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Preface 


What This Book Is, and Isn’t 


This book aims to do what its title suggests: help students—especially those new 
to real analysis—treally understand what it is all about. 

Who could oppose such a goal? A better question is how, concretely, this 
textbook tries to achieve it. Following are some more specific goals, strategies, 
and features. 


Building on calculus basics. Students in the beginning real analysis course that 
this book supports may have little or no course experience beyond single-variable 
calculus (a year, say) and perhaps some exposure to linear algebra, differential 
equations, or multivariate calculus. What can be expected from such students is a 
general, though probably informal, sense of the big ideas of calculus—function, 
limit, derivative, and integral—and some curiosity about the rigorous theory that 
lies behind the techniques. This book’s main strategy is to exploit students’ prior 
experience with and curiosity about calculus in working toward deeper under- 
standing. 


Focus on mathematical proof and structure. Beginning real analysis students 
may or may not already have taken a “proofs” or “bridge” or “transition” course. 
This book makes no hard-wired assumption either way. The long and relatively 
leisurely Chapter 1 includes enough of this material (e.g., a fairly complete in- 
troduction to sets and cardinality) to help less “mature” students make their way 
efficiently into real analysis. A class of students who do have this background can 
simply move more quickly through these preliminaries; there is enough material 
here to support a full semester course in either case. 

Real analysis is, arguably, a challenging area in which to learn proof tech- 
niques, as I hope many students will do. Definitions and theorems, for instance, 
can be quite technical, with multiple layers of quantification. On the flip side (I’m 
convinced!), real analysis has the advantage of being about ideas most students 
will have recently encountered—and many students really are eager to accept a 
challenge so closely connected to their recent experience. 


Focus on mathematical language. Outsiders and newcomers find mathematical 
language surprisingly subtle, sophisticated, and sensitive to word order, usage, 
and punctuation. (Run the definition of uniform continuity by a colleague in the 
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humanities sometime.) Most mathematics students need guidance and explicit 
attention to linguistic conventions in the discipline. This book aims to offer such 
help, especially in earlier sections, before the mathematical and linguistic subtlety 
ramp up—together. 


Redoing some elementary calculus topics “right.” A beginning analysis course 
can help students deepen and consolidate their understanding of calculus ideas. 
For example, few students seem to make deep sense of infinite series on that first 
dreaded pass, usually in Calculus 2. Students who study sequence convergence 
carefully, as in the course envisioned here, are in a far better position to make 
sense of abstract questions of series convergence and divergence and to go deeper 
into such topics as estimation of series “tails.” 


Focus on the basics. This book focuses on (what I take to be) basic elements of 
the theory; neither compactness nor the Lebesgue integral is covered, for instance. 
This is intentional: I’ve willingly traded “coverage” for “simplicity.” (Instructors 
who want to go deeper can readily do so; see below for suggestions.) Covering 
fewer topics, moreover, leaves room for more narrative discussion and concrete 
examples, of which there are a lot. To put it another way, I try to look closely at 
relatively simple things. 


Many examples, many solutions. The book contains many worked-out exam- 
ples and quite a few detailed solutions to exercises, especially in earlier sections. 
I believe that many students learn theory largely “inductively,” from examples, 
and that detailed solutions to a substantial number of problems can usefully illus- 
trate, for newcomers, the language and conventions of mathematical discourse. 


Reader-friendly style. I’ve aimed for a friendly-but-serious narrative style. The 
marginal notes, for instance, are intended mainly as brief nudges, pointers to “dan- 
gerous curves,” or just friendly asides. It takes two to achieve friendliness, of 
course, so the verdict is ultimately up to the reader. 


Increasing sophistication. Mathematical depth and technical sophistication grad- 
ually increase over the chapters. Chapter 1, for instance, is quite leisurely, while 
Chapter 5, on the Riemann integral, is relatively technical (like the integral itself) 
and business-like. 


Using This Book 


Following are some suggestions for using this book. 
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A one-semester course? This book is designed for use in one typical college 
semester, but there is more material here than I would cover in that time, except 
perhaps with veterans of a proofs course. With a less experienced class I might 
omit the coverage of infinite series, for instance, to concentrate more heavily on 
integrals. Or one could omit integrals entirely, and go into more depth on series, 
sequences, and derivatives. 


Supplementary topics. Real analysis offers many possibilities for group or in- 
dividual projects on supplementary topics. These can be especially useful with 
unusually well-prepared or highly talented students, who can otherwise be bored 
in a class of less highly selected students. Here are some ideas; all are readily 
researchable in books or online by motivated students: 


¢ The Cantor—Banach—Bernstein theorem: if f: A — Bandg: B — Aare 
injective, then there is a bijectionh: A — B. 


¢ Exploring the limit superior and the limit inferior (a “guided discovery” on 
this topic appears in the book). 


¢ Exploring Taylor polynomials and remainder theorems (Taylor’s theorem 
is mentioned, but only briefly). 


¢ Exploring compactness of subsets of R, the Heine—Borel theorem, and con- 
nections with boundedness of functions and convergence of sequences. 


° Exploring deeper properties of the Riemann integral. Suppose, for instance, 
that a function f is integrable on [0,1] and f(x) > 0 for all «. Showing 
rigorously that if f > 0 is harder than it might seem, and it raises good 
questions about monotonicity, points of continuity, and integrability. 


Solutions. Solutions to a substantial subset of the exercises are included at the 
end of the book. These include solutions to most of the odd-numbered problems 
and a much smaller subset of even-numbered problems. For more complete solu- 
tions, instructors should contact marketing @akpeters.com. 


Thanks 


This book owes its existence to a large (uncounted but surely countable) set of 
teachers, academic colleagues, St. Olaf College students, publishing company 
professionals, friends, advisors, critics, “competitors,” family, and others. It is 
a pleasure to acknowledge some of them by name—and to claim sole credit for 
errors that survived their best efforts to help. 

Among local colleagues I thank Bruce Hanson, Paul Humke, Loren Larson, 
Arnie Ostebee, Matt Richey, Lynn Steen, Ted Vessey, and many others with whom 
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Ihave discussed matters mathematical and pedagogical for many years. On mat- 
ters of taste and usage—both linguistic and mathematical—my friend Barry Cipra 
is areliable resource. His advice, when I have taken it, has always proved correct. 
These gifted teachers, expositors, and mathematicians have helped me think about 
the deep connections and subtle relationships among teaching, telling, and doing 
mathematics. They have also demonstrated, by argument and example, that seri- 
ous engagement with post-calculus mathematics is possible, and valuable, for a 
broad range of students, not just a small elite. This “big tent” approach to our dis- 
cipline informs and underpins St. Olaf’s very successful undergraduate program, 
and I have kept it in mind when writing this book. 

Academic colleagues elsewhere have taught me a lot, too. These include my 
own mathematics teachers at Washington University and the University of Wash- 
ington; authors and referees from my work with Mathematics Magazine and other 
publications of the Mathematical Association of America; and authors of other 
real analysis textbooks from which I have learned, taught, or read. Real analysis 
can be viewed usefully from many angles; doing so offers depth and perspective. 

St. Olaf College mathematics major students deserve thanks, too. Several 
cohorts of students in sections of Math 244: Elementary Real Analysis have used 
drafts of this book and offered valuable suggestions, criticisms, and pointers to 
typos—and worse. 

Klaus Peters and Charlotte Henderson, at A K Peters, have worked with re- 
markable (indeed, unprecedented in my experience) care, diligence, and intelli- 
gence to make this book better and its production easier. I’m all the more grateful 
to them because, as a sometime editor myself, I know how difficult and exacting 
that work can be. 

Last, but hardly least important, are the support and forbearance of my wife, 
Janet, in everything that book writing entails for anyone unlucky enough to be 
around the process. Thanks. 


Preliminaries: Numbers, Sets, 
Proofs, and Bounds 


1.1 Numbers 101: The Very Basics 


Real analysis is all about real-valued functions of a real variable, so an obvious 
early step is to understand basic properties of the set of real numbers. Many of 
these properties are related to or inherited from slightly simpler sets, such as the 
integers and the rational numbers. We reserve some special symbols for these 
important sets:* 


} the natural numbers; 
} the integers; 

Q: _ the rational numbers; 
R 


the real numbers. 


Using these symbols we can discuss these sets clearly and efficiently, with sen- 
tences like the following: 


42 
NCZCQCR; F,€Q,  V152399025<¢Z; VOB ER\Q. 


What each of these symbol strings says should be clear. Whether each claim is 
true or false is less obvious—but that’s a matter of mathematics, not of symbolic 
clarity. As it turns out, all four claims are true; proving the last assertion takes 
some effort. 

Set notation is useful and efficient, but only when used correctly. Something 
is amiss, for instance, with each of the following expressions: 


@MREQ; G@31DN; GiVNCR. 


*The special font can help avoid confusion with other sets. The choices N and R are made for 
obvious reasons; Q and Z remind us of quotient and Zahlen (German for “numbers”), respectively. 
The symbol C can denote the set of complex numbers. 


The last one says that 93 is 
real but not rational. Notice the 
funny “setminus’” sign. 


We'll see a proof soon; for now 
the proof is beside the point. 


if 108m were a quotient of 
integers, 7 would be rational, 
too, which it isn’t. 


It's f/(7r), as we know from 
beginning calculus. 
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A charitable reader might try to make sense of (i)—(ili), but their problems are 
ultimately fatal—none really makes sense. (A nicer way to say this is to refer to 
errors of syntax.) The problem with (i), for instance, is not the (true!) fact that 
real numbers may be irrational, but rather that R is a set of numbers, and hence 
not even in the running to be an element of the set of rationals. Expression (ii) is 
meaningless for similar reasons, and (iii) is even worse: the expression VN has 
no clear meaning, so the question of containment in R is moot. 


EXAMPLE 1. Does each of the following symbolic claims make sense? If a 
claim makes sense, is it true or false? 


@MQER; (ii) 10°x € Q; (iii) ifa € Qandbe Qthena+beEQ. 


SOLUTION. Statement (i) makes no sense—Q is the wrong type of object to 
be an element of R. Claim (ii) asserts—meaningfully but falsely—that 10°7 is 
rational. Claim (iii) also makes good sense; it says that the sum of two rational 
numbers is another rational number. This, too, is true, because the sum of two 
“fractions” is another “fraction.” ©) 


Integers, Rationals, and Reals 


What exactly is a real number? 
The question may seem absurd. Nobody reading this book will balk at expres- 
sions like 


_ (mt) _ v2 
sin (7) : 


1 
= — and [ faqe-1, 
0 
both of which are rich stews of integers, fractions, and irrational numbers. We 
will not deny or ignore your earlier experience—all of it will be useful—but aim 
to sharpen intuition and make assumptions explicit. 
As a look ahead consider, for instance, the expression 


which describes a certain derivative. Making clear sense of such an expression, 
as we’ll need to do later in studying derivatives, depends on various properties of 
real numbers. Here are some early hints: 


® Real arithmetic: The fractional expression 


f(x) = f(x) 


v-T 
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is built up using subtraction and division of the real numbers x, 7, f(z), 
and f(z), at least some of which are irrational. Depending on its definition, 
the function f may require additional arithmetic operations on reals. 


© Approximation and distance: The idea of limit entails one quantity “ap- 
proaching” another, in a subtle sense that we’ll soon work hard to make 
precise. But it is clear already that we won’t get far without some notion of 
“distance” between numbers, and we’ll need to know that any real number, 
say 7, has plenty of “near” neighbors along which an “approach” to 7 can 
be made. 


° No gaps: For any limit to exist there needs to be a value for the quantity in 
question to approach; if the real numbers had “gaps” there could be trou- 
ble. This modest-seeming requirement, called completeness, turns out to be 
subtler than it might first seem. We’ ll revisit it soon. 


Integers. We’ll build up our description of the real numbers by starting with sim- 
pler sets: first Z and then Q. What’s “simpler” depends, of course, on the situa- 
tion. To a number theorist, for instance, there is nothing simple about the integers. 
In real analysis, however, we’ll need only basic, familiar properties of the in- 
tegers, and we’ll usually take these as “known.” We’ll accept without fuss, for 
instance, that equations like 


a+b=b+a, ab = ba, (a+ b)c=ac+bc, and 
(3a — 5b)(2a — 7b) = 6a? — 31ab + 35d? 


make sense, and hold true, for any integers a, b, and c. We also “know” that every 
integer a has an additive inverse, —a, but that a multiplicative inverse, 1/a, need 
not be an integer. 


The well-ordering property. Here’s another basic fact we’ll just assume: Every 
nonempty set of positive integers has a least element. This fact is known formally 
as the well-ordering property of the positive integers. The name may be myste- 
rious but the property itself should be familiar and believable from experience. 
Among all even positive multiples of 7, for instance, 14 is smallest. We’ll take 
the well-ordering property as an axiom, not as a theorem to be proved. 


Rational numbers. Basic algebraic properties of Q are equally familiar. All four 
equations above, for example, hold just as well for rational numbers a, b, and 
cas for integers. Because nonzero rational numbers have (rational) reciprocals, 
equations and expressions that involve division can now make sense. Here are 
three examples: 


What happens if a = 0? 


It comes from set theory—not 
our subject here. 


In more formal discussion the 
property is sometimes proved to 
follow from even simpler 
axioms. 
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5 0 SSH. 


DW 2 Be 2g pq_ 4g prg ots 4A 
if r pag iP 


3 


All are true, of course, whenever p, q, and r are rational numbers and all denomi- 
nators are nonzero. 

In later work we'll freely use such familiar facts, usually without comment. 
Right now it is a good exercise to derive some properties of Q, assuming basic 
properties of Z. 

For us a rational number r is just a ratio of integers: 

— 


where a,b € Zand b # 0. 


a 
Bb? 
Of course, any given rational number has many possible forms: 


2 14 —222 84 


3.9 Dir ws8ae «igoe 


the first form, in which numerator and denominator have no common factors, is 
called reduced. 


EXAMPLE 2. Prove—assuming basic properties of integer arithmetic—that if p 
and q are any rational numbers, thenp + ¢ € Qandp+q=q+p. 


SOLUTION. The proof is direct: We start with the hypothesis and derive the 
conclusions. Because p and q are rational, we can write 


a @ d a, 
Drape ae: 


where a, b,c,d € Zand b 4 0 4 d. Now we add fractions: 


is 2G Oo 
a rT aaa rae 


which exhibits p + q as the desired ratio of integers. (Both numerator and de- 
nominator are integers, and the denominator is nonzero, thanks to basic integer 
arithmetic, which we take as known.) 


Also, 
_—¢ a_ch+ad_— ad+cb 
LTR gy dike > ee 
where the last equality follows from commutativity of integer operations. The last 
expression is just what we calculated earlier for p + q, so we’re done. © 


EXAMPLE 3. Prove that if p € Q and p ¥ 0, then there is a rational number q 
with pg = 1. (In other words, q deserves the name 1/p.) 
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SOLUTION. We’ll give an explicit formula for q. 
The hypothesis means that there are nonzero integers a and b with p = a/b. 
Now gq = b/a does the job, since 


2A od 
Le rea % 


Q is a field. The following theorem says, in mathematical parlance, that Q is a 
field. 


Theorem 1.1. The set Q has the following properties: 


© Two operations: Addition and multiplication are operations on Q: if p and 
q are any rational numbers, then so are p + q and pq. 


* Commutativity: Addition and multiplication are commutative operations: 
if p and q are any rational numbers, thenp + q=q+p and pq = qp. 


* Associative operations: Addition and multiplication are associative opera- 
tions: if p, g, and r are any rational numbers, then (p+q)+r = p+(qtr) 
and (pq)r = pqr. 


® Identities: The rational number zero is an additive identity: 0 + p = p 
holds for all p € Q. The rational number one is a multiplicative identity: 
lp = pholds forallp € Q 


* Inverses: For every rational number p, the rational number —p is an ad- 
ditive inverse: p + (—p) = 0. For every nonzero rational number p, the 
rational number 1/p is a multiplicative inverse: p-1/p = 1. 


® Distributivity: Multiplication distributes across addition: For any rationals 
p,q, and r, p(q +r) = pq + pr. 


All parts of the theorem can be proved in the spirit of the preceding examples, 
assuming similar properties of Z. But... 


EXAMPLE 4. Zis nota field. Why not? 


SOLUTION. To be a field, a set must satisfy each of the long list of properties 
in Theorem 1.1. Proving all this can be daunting. But there’s a happy flip side: 
Disproving that Z is a field is easy. It is enough to find just one property that fails 
for Z, and for this even a single counterexample suffices. We could just observe, 
for instance, that the integer 42 has no multiplicative inverse among the integers, 
and leave it at that. ©) 


Nothing in the theorem should 
surprise you; the point is to 
collect and formalize key 
properties. 


Like other proofs we've given, 
this one exploits basic 
properties of integers. 


See the exercises. 
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QisnotR. The set Q is clearly infinite—it contains all the integers, for one thing. 
A bit less obvious, perhaps, is the fact that between any two different rationals lie 
infinitely many more rationals. Between 3.14 and 3.15, for instance, lie 


3.141, 3.142, ..., 3.149, 3.1411, 3.1412, ..., 3.14237568, 3.14237569, ... 


and so on. With so many rational numbers around, one might wonder whether 
there are any irrationals—numbers that cannot be written as ratios of integers. 
Nowadays everybody “knows” the answer: J/2, V3, </2, 7, e, and many other 
favorite numbers are all irrational. But none of these facts is completely triv- 
ial. Indeed, the Pythagoreans (around 500 BCE) saw all numbers as rational. 
By some accounts, they drowned the philosopher Hippasus as a heretic after he 
demonstrated that a square of side 1 has irrational diagonal. We’ll take that risk 
and still give a proof. 


Theorem 1.2. \/2 is irrational. 


Proof: The proof is by contradiction: Assuming V2 is rational, we’ll derive an 
absurdity. 

Assume, then, that ./2 is rational. We can write /2 = oe for some positive 
integers a and b, not both even. (If both were even we could cancel a common 
factor of 2.) Next we do some algebra: 


a a 2. 2 
V2 => 2=5 2b? = a2. 


Now 20? is certainly even, and so a” must be even, too. But then a itself must be 


even; otherwise, a? would be odd. Thus we can write a = 2c for some integer c, 
and we have 
2b? = a? = 4c? 6? = 2c?. 


This implies that b (like a) is even, which contradicts our earlier assumption, and 
completes the proof. 


A more general theorem can be proved by similar means: 
Theorem 1.3. ./n is irrational for every integer n that is not a perfect square. 


Infinitely many irrationals for the price of one. It took some work to find even 
one irrational number. This done, however, it is surprisingly easy to find many, 
many more irrationals. 


Theorem 1.4. If x is irrational and r # 0 is rational, then 


xL 
ar, utr, “w-r, Ifa, -, and — 
r xL 


are all irrational. 
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Proof: There is less to this than meets the eye. Everything boils down to the 
field properties of Q, listed in Theorem 1.1. For example, suppose (aiming for 
a contradiction) that xr is rational. We know from Theorem 1.1 that 1/r is also 
rational, and so the product 


wrx-=2 
r 


is rational, too. This contradicts our hypothesis, and so xr must be irrational. The 
remaining parts are similar, and are left as exercises. 


Real numbers. So what is a real number? 

For us, the set R is the union of all the rational and all the irrational numbers. 
Admittedly, this is not a logically airtight definition. It ignores, for example, such 
difficult questions as how to be sure we’ve included all the needed irrationals. But 
our informal view is a good start, and we’ll deepen it in the next section as we 
study properties and alternative ways of viewing the real numbers. 


Exercises 


1. As in Example 1, page 2, decide whether each of the following claims 
makes sense. If a claim makes sense, is it true or false? 
(a) Ifr € Qthen 5r € Q. 
(b) V8CR\Q. 
(c) {V8} CR\Q. 
(d) Ifa € QandbER\ QthenabeER\Q. 
(e) Ifa € Qandbe Nthena/bEQ. 
(f) If a € Q then there exists n € N such that na? > 100. 
2. We said in this section that Z is not a field. Which of the several require- 


ments in Theorem 1.1 does Z fail? Which does it pass? Give an example to 
illustrate each failed requirement. 


3. For which integers a is 1 /a an integer? For which integers a is 1/a rational? 
For which integers a is 1/a = a? No proofs necessary, but be sure to 
consider all possibilities. 


4. The set of irrationals is not a field because (among many other reasons) the 
irrationals contain no multiplicative identity. Give one reason—as briefly 
as possible—why each of the following sets is nota field. 


The reals can be defined 
rigorously, starting from the 
rationals; this is sometimes 
done in more advanced 
courses. 
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(a) The set S; of nonnegative rationals. 
(b) The set S_ that includes all the irrationals and all the integers. 


(c) The set $3 of all numbers of the form a + b\2, where a and b are 
integers. 


5. Let p and q be two rational numbers. In each part following, decide whether 
the given expression must be a rational number. If so, explain why, referring 
to any theorems in this section. If not, give a counterexample. 


pt 
2 
Prd 


0) ae 


(c) Vp? +¢ 
(d) Vp? + 2pq + q? 


6. The well-ordering property (see page 3) says something special about the 
set N: 


(a) Z (the average of p and q) 


If S CNand S ¥ 9, then S has a least element. 
Give informal answers, not formal proofs, in each part following. 


(a) Does Q have the well-ordering property? (In other words, does every 
nonempty subset of Q have a least element?) Why or why not? 


(b) Does the set R = {1, 10, 100, 1000, ...} have the well-ordering 
property? Why or why not? 

(c) Does the set T = { —3, —2, —1, ..., 41, 42 } have the well-ordering 
property? Why or why not? 

(d) Replace the word “least” with “greatest” in the well-ordering property 
above. Does the result hold for N? For T'? For Z \ N? 


7. We know from elementary calculus that In n tends to infinity (“blows up”) 
slowly as n tends to infinity. It follows that In In 1n n also tends to infinity, 
but slower still. Use the well-ordering property to explain why there must 
be a least positive integer no such that InInIn np > 2. (Notes: Finding no 
exactly would be tricky, since no is very, very large. For fun, try to estimate 
how many digits no has.) 


8. Let both x and r be irrational numbers, and consider the quantities xr, «+7, 
x —r and x/r, as in Theorem 1.4. Must any quantity be irrational? Must 
any be rational? No proofs needed, but give examples to illustrate all the 
possibilities. 
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. Imitate the proof of Theorem 1.2 to show that 3 is irrational. (Hint: If a? 


is divisible by 3, then a is divisible by 3, too. This is easy to prove, but just 
assume it here.) 


. Use Theorem 1.4 in this problem. 


(a) Show that if x? is irrational, then z is irrational, too. 
(b) We know that J/2 and V3 are irrational. Show that J2 + V3 is irra- 
tional, too. 


(c) All of /2, V3, and v/5 are irrational. Show that /2 + /3 + V5 is 
irrational, too. (This is harder.) 


. Add details (explain anything that isn’t obvious) to the following hints to 


give another proof that \/2 is irrational. (The proof’s very clever idea is 
attributed to Fermat.) 

If /2 = a/b holds for any positive integers a and b, then there must be a 
smallest possible b, in the sense that no smaller denominator works. Let 
such a and b be given; note that 2b? = a?. Now consider the new fraction 


2b-a_ a’ 
a—b bv 
Explain why (i) both a’ and b’ are positive integers; (ii) b’ < b; (iii) 


ae 
(+) = 2, and so deduce that /2 = a/b is impossible. 


. The set M2x2 of 2 x 2 matrices with real number entries permits matrix 


addition and matrix multiplication, so we can ask about the properties men- 
tioned in Theorem 1.1. No proofs needed, but give examples to illustrate 
any properties that fail. 


(a) Are addition and multiplication in M2... commutative? 


(b) Which elements in M2,.2 have additive and/or multiplicative inverses? 


(c) Does addition and multiplication in M2,.9 satisfy the distributive prop- 
erty? 


. Let F be the set of real numbers of the form a + b\/2, with a and b ra- 


tional numbers. Show that F' is a field. (Hint: The hardest part concerns 
multiplicative inverses.) 


. Consider the set Zz = {0, 1}. 


(a) Show that Ze is not a field with the usual operations of multiplication 
and addition. 

(b) Show that Z, is a field if we use the two operations (i) ordinary mul- 
tiplication; (i) addition “mod 2”: 0 +0=0Oand0+1=1+0=1, 
but1+1=0. 


Are they all true of the sets 
above? 
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1.2 Sets 101: Getting Started 


This section is a brief introduction to (or review of) rudiments of set theory that 
play some role in real analysis. An important purpose is to establish vocabulary 
and notations that will help us streamline later discussions. 

A set is any collection of objects, called elements of the set. Here are some 
samples: 


A = {January, February, March, ..., December} , 
B = {January, March, May, July, August, October, December} , 
C= {1, {2,3}, a7 + 1, November, elephant} : 


D= 2,3}, 
I= (2,3) ={4#|2<2 <3}, 
NE 993). 


(—) = the empty set. 
Observe: 


¢ Names: It is common, but not required, to use uppercase letters to denote 
sets. For a few very special sets, like N and ) above, we use special charac- 
ters. 


® Looks can deceive: The sets D = {2,3} and I = [2,3] bear some typo- 
graphical resemblance but are otherwise very different—D has only two 
elements while J, like every nonempty interval, has infinitely many. 


Different descriptions: Sets can be described in various ways. For B, C, 
and D, for instance, we listed each set’s elements explicitly. For A, J, and 
N we described a pattern or condition that elements must satisfy. What 
really matters is clarity. 


Weird sets: The strange set C illustrates that a set’s elements may be quite 
different from each other. In particular, a set may contain other sets as ele- 
ments. We’ll usually stick to relatively tame sets in studying real analysis. 


Notations and operations. Standard symbols let us describe set properties and 
operations clearly and concisely. Just a few go a long way: 


© Subsets and containment: The expression B C A says (truthfully for the 
sets above) that B is a subset of A; the C-like symbol suggests containment. 
If B C¢ A, then B is a proper subset of A. 


In a similar spirit, all of these expressions: 


ADB; BCA; BCA; IQDD; ) 


IN 
v 


describe various containment relations. 
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© Membership: The expression a € A means that a is an element of A. Here 
are some (true) examples for the sets above: 


March € A; 2€ D; April ¢ B; {2,3} €C; {2,3} € I. 


The last two claims might be surprising. Notice that the set {2, 3} is indeed 
an element of the peculiar set C’, but not of J, which contains only numbers. 


New sets from old: Any two sets S and T can be combined in various ways 
to form new sets: 


union: SUT={a|xeSorxeT}; 
intersection: SAT={x|x€ SandxreT}; 
set difference: S\T={x|x€ Sanda ¢T}; 
Cartesian product: SxT={(«#,y)|x«eSandyeT}. 


Here are some (true) examples from the sets above; note that all of the 
results are sets: 


ANC = {November}; AUB=A; ANB=B; 
A\ B = {February, April, June, September, November} ; B\A=9; 


Ax N= {(m,n) | misa month and n is a positive integer} . 


In practice, statements about sets may involve complex symbolic combinations. 
Making sense of them takes careful reading. 


EXAMPLE 1. Let Z, Q, and R denote (as usual) the sets of integers, rationals, 
and reals, respectively, and consider the sets 


S={xeR|a*-x-1=0}; T={reER|2?—-x-1>5}. 


The sets S and T illustrate set-builder notation: they are “built” from a larger set 
using a selection rule or membership criterion. (The vertical bar | means some- 
thing like “such that.”) 

What does each of the following assertions mean? Which are true? 


Gi) S €Z; (ii) S CQ; (iii) SC R\Q; (iv) 4 € T. 


SOLUTION. Statement (i) says that S—a set of numbers—is itself an integer; 
this is clearly false. Statements (ii) and (iii) make better sense; (ii) says that all 
roots of 2? — a —1 are rational, while (iii) says that these same roots are irrational. 
Who’s right? Well, by the quadratic formula, the two roots are (1 + V5) /2, both 


Read € as “in.” 


On the other hand, {2,3} C /. 


Check for yourself. 


1. Preliminaries: Numbers, Sets, Proofs, and Bounds 


of which are real but irrational, so (iii) is true and (ii) is false. Statement (iv) boils 
down to the claim that 4? — 4 — 1 > 5, which is clearly true. 

We might, by the way, have saved some work by first rewriting S and T’ in 
simpler or different forms. As we’ve seen, 


s- [4 4h 


2 2 


The quadratic formula also shows that x? — 2 — 1 = 5 has the two roots x = —2 
and x = 3, and so 
T={t#ER|x<-2ora>3}. .) 


Intervals. Intervals in the real line are familiar but useful sets in studying real 
analysis. Calculus veterans have seen countless examples; we collect a few as 
reminders of the possible variety and of some useful descriptive language. 


interval description inequality form 
[—2, 3] closed, bounded {tER|-2<2< 3} 
(—2,3) open, bounded {e#ER|-2 <2 <3} 
[—2,00) closed, unbounded {xER|-2< x} 


(—oo,3) — open, unbounded {rER|a<3} 
(—2,3] half-open, bounded {x €R|-2<2<3}. 


In particular, closed intervals contain their endpoints, if any, while open in- 
tervals do not. All intervals, open or closed, bounded or unbounded, share two 
defining properties: 


Definition 1.5. A set J C R is an interval if (1) I contains at least two points; 
(ii) if a and b arein J anda < x < b, then x € I, too. 


EXAMPLE 2. Let J and J be any two intervals. What possible forms can the 
intersection J J take? 


SOLUTION. The intervals J and J might miss each other entirely; then JNJ = 0. 
Or I and J might intersect in a single point, as do J = [1,3] and J = [8,5]. More 
interesting is the fact that only one other possibility exists: If J contains at 
least two points, then J J is an interval. 

The hard way to prove this is to handle many special cases, depending on 
whether each of J and J is open, closed, bounded, unbounded, etc. The easy way 
is to use Definition 1.5, in which only (ii) is a live question. 
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Suppose, then, that both a and 6 are in JM J, anda < x < b. Since J is an 
interval, Definition 1.5 guarantees that x € J. For the same reason, we must have 
x € J, too. Thus x € IM J, and we’re done. ©) 


Exercises 


1. Consider several sets discussed in this section: 


A = {January, February, March, ..., December} ; 
B = {January, March, May, July, August, October, December} ; 
C= {1, {2,3}, x? + 1, November, elephant} : 
D= {2,3}; F=([2,3)={x#|2<a<3}. 

(a) Clearly, B C A. Are there any other subset relations among A, B, C, 
D, and I? Is any of these sets an element of another? 


(b) Find a rule or pattern to define B in terms of A. (We could use 
such a rule to write B in set-builder notation: B = {m € A | 
... something ... }. 


(c) Describe the set A x D. How many elements are there? 
(d) Find (list the elements) each of the following sets: A \ B; B \ A; 
ANC;BNA;DOI;DUT. 


2. Consider the sets S = {x €R| a? +2 =0}andT = {x eR | x7+ 
x <5}. (We used similar sets in Example 1, page 11.) 


(a) Rewrite S and T in simpler forms. (One is a finite set and the other 
an interval.) 


(b) Decide whether each of the following statements is true or false, and 
explain: SCN; SCT; TAQFO;-28E€Q\T. 


(c) Give the simplest possible description of the set U = {x € R | 27+ 
x <0}. 


3. Let A and B be subsets of R, and consider the two claims 


@) R\ (AUB) =(R\ A)U(R\ 8B); 
(i) R\ (AUB) =(R\A)N(R\B). 
(a) One of (i) and (ii) is true and the other false. Identify the false claim 
and give specific sets A and B to show it is false. 


(b) What happens in the special case that A = B? 


11. 
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. Give specific examples of intervals J and J for which I M J is (a) open; 


(b) closed; (c) half-open; (d) open and unbounded. Is it possible in each 
case to choose J and J so that neither J C J nor J C I? 


. Is it possible for both J and R \ I to be intervals? Can both J and R \ I be 


bounded intervals? In each case, give an example or explain why not. 


. Can any interval have exactly 123456789 points? Why or why not? 


. From any set S we can create the new set P(S), the power set of S, con- 


sisting of all subsets of S. If S = {1,2}, for instance, then P(S) = 
{0, {1}, {2}, {1,2} } 

(a) Find P(S) if S = {1,2,3}. 

(b) Show that if S C T, then P(S) Cc P(T). 


(c) Let Nig = {1,2,...,10} and Ni; = {1,2,...,10,11}. Explain 
why P(N) has twice as many members as P(Nio). (We'll prove 
this formally in a later section.) 


. Let Nip = {1,2,..., 10}. 


(a) Let S be the set of all three-member subsets of Nij. How many ele- 
ments does S have? 


(b) Let T be the set of all three-tuples (a, b,c) with a, b, and c in Nio. 
How many elements does T have? 


(c) Let S10 be the set of all permutations (i.e., orderings) of the elements 
of Nio. How many elements does $9 have? 


(d) Do any two of the sets Nio, S, T, S19 have nonempty intersection? 


. Let S' be a set with n elements. Let S’ be the set of all (n — 1)-element 


subsets of S’. How many elements does S’ have? Why? 


. Let Nioo = {1,2,..., 100}. Let Ago and Asg be the sets of all 42- and 


58-member subsets of Nio9, respectively. Show that A42 and Asg have the 
same number of elements. 


The xy-plane can be thought of as the Cartesian product R x R (hence the 
notation R?). Sketch each of the following subsets of R x R. 


(a) {1,2,3} xR 

(b) R x {1,2,3} 

(c) ZxXN 

@) {(2,y) |y=27} 
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(e) {(z,y) |v =sin y} 
@) {(@,y) | 2? +9? =—1} 


12. A simple picture can be drawn in a n x m square grid by coloring each 
square either black or white. Think of the grid as the Cartesian product 
G = H x V, where H = {1,2,...,n} and V = {1,2,...,m}, with 
lower-left corner (1, 1) and upper-right corner (n,m). 


(a) Suppose the set of black squares is {(z,y) | c+y=m+n+l}. 
Draw the picture. 


(b) Describe the set of black squares in a “checkerboard” picture (with 
lower-left square black). 


(c) What does the element (2, 3, black) in the set G x {black, white} rep- 
resent in this context? What does the set Gx {black, white} represent? 


(d) A picture in the sense above can be thought of as an element of P(G). 
Explain. 


1.3 Sets 102: The Idea of a Function 


In everyday mathematics we might discuss “the tangent function” or “the function 
f(x) = x?” Inacalculus course such language may be fine, but in other settings 
we’ll need a more general description of functions—using the language of sets. 


Definition 1.6. Let A and B be nonempty sets. A function f : A — Bisa rule 
for assigning to each “input” element a in A one and only one “output” element, 
f(a), in B. Here A is the domain of f and B is the codomain. 


In other words. Alternative language is available. A function f is sometimes 
called a mapping, and we say that f maps a domain element a to a codomain 
element b, sometimes called the image of a. In symbols, we might write f : a t> 
b, or just a + bif the function name is understood. 


EXAMPLE 1. How is the “calculus description” f(x) = x? related to the formal 
definition? 


SOLUTION. The formula f(x) = x? makes the rule perfectly clear: For inputs 
—3, V2, and 1.2345, the corresponding outputs are f(—3) = 9, f(/2) = 2, and 
f (1.2345) = 1.23457. The domain and the codomain, on the other hand, are open 
to choice. In a calculus course we might, if pressed, use the natural domain—the 
set of all real numbers for which the rule makes sense. For f(a) = «7, that’s R 


16 


1. Preliminaries: Numbers, Sets, Proofs, and Bounds 


itself. (For g(x) = tan(x), the natural domain omits some real numbers.) In a 
number theory course, we might use N as domain. 

The codomain, too, is open to choice. For a given domain, we need only as- 
sure that the codomain contains all possible outputs. With domain R, for instance, 
we could use as codomain for f(x) = x? any set that contains all the nonnegative 
reals. This might be R itself, the infinite interval [0,00), or something stranger, 
like (—42, 00) or C. With domain N, we could reuse N as codomain, or choose 
any set of integers that contains all positive perfect squares. © 


The moral. The preceding example shows that a function is more than a formula. 
A function is a 3-part package: a domain A, a codomain B, and a rule (which may 
or may not be a symbolic formula) for assigning a unique output b = f(a) in B to 
every input ain A. The notation f : A — B emphasizes this three-fold nature. In 
practice the domain or codomain or both are sometimes understood from context, 
or even ignored, but they’re always waiting in the wings. 


Range and codomain. The range (or image set) of a function f : A — Bis the 
set of all outputs: 

range of f = {f(a)|ae A}. 
Note that the range is always a subset—perhaps a proper subset—of the codomain. 
For f : R= R given by f(x) = x?, for instance, the range is the interval [0, oo), 
which omits all the negative numbers. 


Function examples. A good way to explore functions is through examples and 
non-examples, including some strange ones with strange names. (We’ll usually 
stick with one-letter function names, although longer, more descriptive names are 
perfectly legal.) 


EXAMPLE 2. Consider the sets 


A = {January, February, March, ..., December} ; 
Nie = {1,2,3,...,12}; P = {Alice, Bob, Carol, Dave} . 
Is each of the following a function? 


* MONTHNUMBER : A — Nj2, where MONTHNUMBER (January) = 1, 
MONTHNUMBER (February) = 2,..., MONTHNUMBER (December) = 12. 


® WORDLENGTH : A — Nji2, where WORDLENGTH(a) is the number of 
letters in a. 


® BIRTHMONTH : P — A, where BIRTHMONTH(p) is the month in which 
p was born. 
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° 7: P — Nip, where j(p) = MONTHNUMBER (BIRTHMONTH(p)). 


© SQUARE : Nig — No, where SQUARE(n) = n?. 


SOLUTION. All of MONTHNUMBER, WORDLENGTH, BIRTHMONTH, and 7 
are functions. In each case, every member of the domain is paired with one and 
only one member of the codomain. Some of the rules are admittedly peculiar. 
To find BIRTHMONTH(Alice) and j(Alice), for instance, we would need more 
information. Still, the rules make sense because everyone has a unique birth 
month. By contrast, SQUARE has a problem: Squaring members of Ni2 produces 
results outside Nj. ) 


Seeing Functions 


Graphs. Graphs are deservedly popular in elementary calculus. Properties like 
smoothness, steepness, rising vs. falling, concavity, and existence of asymptotes 
reflect and reveal a lot about functions. 

A less familiar fact is that, like functions, graphs can also be described in 
the language of sets. The graph of a calculus-style function, say, f(x) = 27, is 
a curve in the xy-plane, made up of points of the form (x, f(x))—in this case, 


(x, x). But the idea of a graph makes sense for any function: 


Definition 1.7. Let f : A — B bea function. The graph of f is the set 
G = {(a,b)|ae€ Aandb= f(a)}. 
Observe: 


° A graph is a set: The graph of f is a certain set of ordered pairs, and thus 
a subset of the Cartesian product A x B. 


© But not just any set: For f to be a function with domain A, its graph G 
must contain one and only one point (a,b) for each a € A. The graph of 
a function f : R — R, for instance, must contain exactly one point of the 
form (3, y). 


* Maybe acurve, maybe not: Graphs of calculus-style functions are often 
nice curves. Indeed, a lot of beginning calculus is about connecting ge- 
ometric properties of curves to analytic properties of functions. But some 
functions have graphs that are nothing like curves. The graph of 7 in Ex- 
ample 2, for instance, has points of the form (Carol, February)! 


° Other “graphs” out there: Like other useful math words, “graph” has dif- 
ferent meanings in different settings. The graph of an equation, for exam- 
ple, is the set of points (x, y) for which x and y satisfy the equation. The 


We could “fix” this problem by 
choosing a larger codomain, 
such as N. 


In pre-calculus lingo, this is the 
vertical line test. 
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graph of x? +y? = 1 is acircle, and therefore not the graph of any function. 
In this book, “graph” will refer only to functions. 


® One idea, two views: A function and its graph are so closely linked—either 

one completely determines the other—that functions are sometimes de- 

From this perspective, functions fined, rather than just visualized, as sets of ordered pairs. We’ll use both 
are sets in their own right. of these viewpoints freely. 


Other views. For some functions, geometric graphs make little sense, so we 
use other descriptive devices—tables, diagrams, etc. To describe the function 
BIRTHMONTH in Example 2, for instance, we could use a table: 


The BIRTHMONTH function 


We could also use a diagram to describe BIRTHMONTH, showing the domain, the 
codomain, and arrows connecting inputs p to their corresponding outputs h(p). 
(In such a view, the range is the set of “arrowheads,” where incoming arrows 
Draw your own diagram for “land.’) 
BIRTHMONTH. Yet another, even less formal notation is sometimes useful. To describe the 


MONTHNUMBER function, we could just write 
January'> 1, February'> 2, Marcht> 3, ..., Decembert> 12 


to convey the idea. (The ++ symbol is usually read as “maps to.’”) 


All Kinds of Functions 


Functions come in many varieties. We will now define and illustrate, mainly using 
“toy” functions, some properties a given function may or may not have. 


Definition 1.8 (One-to-one functions). A function f : A — B is one-to-one (or 

injective) if f(a.) # f(a2) whenever a, and az are in A and a; # az. In equiva- 
Convince yourself of the lent words: f maps different inputs to different outputs. In equivalent symbols: 
equivalence. 


f(a1) = f(az) = a, = a2. 


The next two examples will help illustrate the meaning of—and how to prove or 
disprove—injectivity. 
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EXAMPLE 3. The BIRTHMONTH function from Example 2 is not one-to-one, 
because BIRTHMONTH(Bob) = BIRTHMONTH(Carol) = December, even 
though Bob 4 Carol. By contrast, MONTHNUMBER, which maps each month 
to its number in the year, is one-to-one. The “inclusion” function: : Q — R 
with rule i(a) = x is one-to-one, since outputs are inputs. The constant function 
c: R > R with rule c(x) = 3 is at the opposite extreme—aill inputs are mapped 
to the same output. 

The linear function L : R — R given by L(x) = 3a + 5 is also one-to-one. 
To prove this, let’s check the equation in Definition 1.8, using some algebra: 


D(a1)= D(a) =, +5 =387Q4+5 = 8x1 = 322 
= v= £2, 


as desired. ©) 


Definition 1.9 (Onto functions). A function f : A — B is onto (or surjective) if, 
for every b € B, there is some a € A with f(a) = b. In equivalent words: Every 
element of the codomain is also in the range. In equivalent symbols: 


{f(a) |a€ A} =B. 


EXAMPLE 4. The function BIRTHMONTH from Example 2 is not onto, because 
no member of P was born in (say) March. The function MONTHNUMBERR : A > 
Nj2 is onto, because the 12 months range in number from | to 12. The “inclusion” 
i: Q — R with rule i(x) = z is not onto, because the codomain includes 
irrationals, but the range does not. 

Is the quadratic function g : R — R given by g(x) = x? — 6z surjective? 
The short answer is no: A little calculus or algebra shows that ¢(3) = —9 is the 
minimum value, so the range of q is the interval [—9, 00), a smaller set than the 
codomain. We could make q surjective by using [—9, co), not R, as the codomain. 
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Bijective functions. A function that is both injective and surjective is called bijec- 
tive, or, equivalently, a one-to-one correspondence. Our MONTHNUMBER func- 
tion is one bijection. Another is suggested by a few values: 


lra 


2b, 38rc ..., 2ry, 2Brz. 
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Here are two bijections from calculus, this time in a matched pair: 
Tie (-+ “) —R, withrule f(x) = tan(z) 


g:R=> Ge ) , withrule g(x) = tan™!(z). 


Finding even one pair of inputs 
that give the same output is 
enough. 


Using “onto” as an adjective 
sounds ugly, but everybody 
does it. 


Finding even one such 
codomain member is enough. 


What are the domain and 
codomain? What's a good 
name for this function? 
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This makes sense with respect 
to nested parentheses—but it 
goes against the usual grain of 
reading from left to right. 
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You can readily convince yourself, perhaps with graphs, that f and g are indeed 
one-to-one and onto. But notice a little surprise: f and g are one-to-one cor- 
respondences between an interval of finite length and the entire real line. Such 
strange behavior is possible when infinite sets are involved; we will explore infi- 
nite sets further in a later section. 


New Functions from Old 


Elementary calculus courses are full of complicated functions, like 
f(a) =sina + (14+ e”)sin (x + e”), 


concocted in various ways from simpler ingredients. Tools like the product rule, 
the chain rule, and u-substitution help us find derivatives and antiderivatives of 
such built-up functions as appropriate combinations of simpler derivatives and 
antiderivatives. 


Composition. Among these familiar ways to combine functions, composition is 
not quite like the others, requiring some special care with domains and codomains. 
If f: A— Bandg: B — C are functions, then we can compose f and g to 
form the new function 


gof:A—>C, withrule go f(a) = g(f(a)). 


Recall, especially, that order matters: The notation g o f means that g follows f. 

The two compositions g o f and f og are seldom equal—even if both make good 
7 

sense. 


EXAMPLE 5. Which compositions make sense for the functions 


MONTHNUMBER: A —> Nia, 
WORDLENGTH: A > Nj, and 
BIRTHMONTH: P — A 


from Example 2? 


+A minor technical point: Above, to avoid extra notation, we used the same symbol, B, both for 
the codomain of f and for the domain of g. In fact, these sets need not be identical. What really 
matters is that the composition rule g o f(a) = g(f(a)) make good sense. This occurs as long as the 
range of f is contained in the domain of g. As we have seen, a function’s codomain is often open to 
some choice, so this subtlety seldom causes difficulty. 
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SOLUTION. The compositions 
WORDLENGTH o BIRTHMONTH : P — N32 


and 
MONTHNUMBER o BIRTHMONTH : P — Nj 


make sense (the former is the function we called 7 in Example 2). For instance, 
WORDLENGTH o BIRTHMONTH (Carol) = WORDLENGTH (December) = 8 


and 


MONTHNUMBEROBIRTHMONTH (Carol) = MONTHNUMBER (December) = 12. 


Another possibility, SQUARE o MONTHNUMBER : A — Np, fails right out 
of the box, because SQUARE : Nig — Nj isn’t a function at all. But there is an 
easy fix: If we use the almost-identical function 


SQUARE:N—N, withrule SQUARE(n) =n’, 


then the composition SQUARE o MONTHNUMBER : A — N makes good sense, 
and (say) 


SQUARE 0 MONTHNUMBER(March) = SQUARE(5) = 25. 
Many other compositions, like 
BIRTHMONTH © BIRTHMONTH and BIRTHMONTH co WORDLENGTH, 
make no sense. ©) 


Properties of composite functions. If both f and g are, say, injective, must go f 
be injective too? We answer several such questions in the following theorem, 
leaving some proofs to the exercises. 


Theorem 1.10. Let f: A— Bandg: B = C be functions. 
(i) If both f and g are one-to-one, then so is go f. 

(ii) If both f and g are onto, then so is go f. 

(iii) If both f and g are bijective, then so is go f. 

(iv) If g° f is one-to-one, then so is f. 


(v) If go f is onto, then so is g. 
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We explained why in 
Example 2. 
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Notes on proofs. We’ll prove (ii) and (iv), leaving (i) and (v) to the exercises. 
Statement (iii) just combines (i) and (ii), so there is nothing new to prove. 

To prove (ii) we need to show that for any c € C there is some a € A with 
go f(a) = c. We can do this directly. For given c € C,, we know (because g is 
onto) there exists b € B with g(b) = c. Because f is onto there exists a € A with 
f(a) = b, and this a does the job: g o f(a) = g( f(a) ) = g(b) = c as desired. 

To prove (iv) we’ll show that if f is not one-to-one, then g o f cannot be one- 
to-one either. (In math-speak, this is an indirect proof; more details on such things 
in later sections.) Suppose, then, that a; 4 ag, but f(a1) = f(a2). Then we’d 
have g( f (a1) ) = g( f (a2) ), which is just another way of saying that go f(a1) = 
g 0° f(az). Thus g o f is not one-to-one, and the proof is done. 

Note, finally, that g need not be one-to-one just because g o f is. The calculus 
formula y = (e” + 1)? illustrates this. (Let f(a) = e” + 1 and g(x) = x; further 
details are in exercises.) 


Inverse Functions 


Another important definition: 


Definition 1.11 (Inverse functions). Let f : A > Bandg: B — A be func- 
tions. We say f and g are inverse functions if both 


f(g(6)) =6 forallbe€ B and g(f(a))=a forallac A. 
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In this case we can write f = g~' org= f7!. 


Recall, first, that “inverse” has several meanings in mathematics. For instance, 
the numbers 3 and —3 are additive inverses because 3+(—3) = 0. Similarly, 3 and 
1/3 are multiplicative inverses because 3 x 1/3 = 1, the multiplicative identity. In 
a similar spirit, two functions are inverses if composing them (rather than adding 
or multiplying) produces identity functions: 


fog(b) =b forallbe B; gof(a)=a foralacA 


The function MONTHNUMBER : A — Nj of Example 2 has a natural inverse 
function—let’s call it NUMBERMONTH : Nj — A, with values as follows: 


MONTHNUMBER (January) = 1; NUMBERMONTH(1) = January; 
MONTHNUMBER (December) = 12; _NUMBERMONTH(12) = December. 


By contrast, WORDLENGTH : A — Nj, which maps a month to its number of 
letters, has no inverse function LENGTHWORD : Nj2 — A. One problem is that 


WORDLENGTH (June) = 4 = WORDLENGTH (July), 
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so there’s no clear choice for LENGTHWORD(4). Another problem is that no 
month has one letter, so there’s no sensible value for LENGTHWORD(1). 

These toy examples suggest an important general fact: a function f : A — B 
has an inverse g : B — A if—but only if—f is both one-to-one and onto. In this 
case, f and g simply “reverse” each other’s work: if f maps a to 6, then g maps b 
to a, and vice versa. 


Relations 


Let A be a nonempty set, and f : A — A any function on A. We can think of f 
as its graph Gr = {(a, f(a)) | a € A}. Note that Gy is a subset of A x A—buta 
subset with special properties that reflect the fact that f is a function. For instance, 
G'y cannot contain two different elements of the form (ao, a1) and (ao, a2). 


Definition 1.12 (Relation on a set). A relation R ona set A is any subset of A x 
A. If (x,y) € R we write x Ry, and say that «x is related to y. 


This may sound forbiddingly abstract, but familiar (if lightly disguised) examples 
are all around us. 


© Functions are relations: The graph of any function f : N — N is a subset 
of N x N, so f is also a relation, and we could write x f y instead of y = 


f (2). 


° Extreme examples: The extreme (but uninteresting) cases are R = N x N 
and R = Q. In the first case all pairs of integers are related; in the latter, no 
integer has any relatives, even itself. 


© Equality: The set 
EQUALS = {(n,n) | n € N} 


corresponds to the equality relation: each integer is related (i.e., equal) only 
to itself. In this case, of course, we usually write n = n, not n EQUALS n. 


°® Order relations: The set 
LESSTHAN = {(m,n) | m <n} 


is an order relation: each integer is related to every integer larger than 
itself. Again we have a handy symbol, and we write 3 < 4 instead of the 
more cumbersome 3 LESSTHAN4. (Similar relations correspond to other 
ordering symbols: <, >, and >.) 


A function and its graph are 
essentially the same thing. 
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Divides: The set 
DIVIDES = {(m,n) | m divides n evenly} 


defines the “divides” relation, in which we find 3 DIVIDES 123456, 
42 DIVIDES 42, and 1 DIVIDES n for all n. Number theorists use the handy 
symbol m | n rather than m DIVIDES n. 


Equivalence relations. The definition of a relation on A—any subset of A x 
A—is very, very general. In practice, many useful relations have some special 
properties, with impressive names. An equivalence relation R on A is one that is 


(i) reflexive: a Ra for alla € A; 
(ii) symmetric: if a Rb, then b Ra; 
(iii) transitive: ifa Rbandb Rc, thena Re. 


Equality (on any set) is the prototype and simplest—but not the only—example 
of an equivalence relation. Consider, for example, the SAMEBLOODTYPE rela- 
tion on the set of humans, where x SAMEBLOODTYPE y means (of course) that 
x and y have the same blood type. It is easy to see that SAMEBLOODTYPE is 
indeed an equivalence relation, which sorts people into four “families” (called 
equivalence classes) based on their blood types: A, B, AB, or O. More examples 
are in the exercises and later in this book. 


Exercises 


1. Each part following gives the rule (implicit in the name) for a possible func- 
tion. For each rule, find a reasonable domain A and codomain B to create a 
function. (Try to make A relatively large and then make B relatively small.) 
Is your function injective? Is it surjective? 


(a) MOTHER: A > B 
(b) FIRSTBORNSON : A > B 
(c) EYECOLOR: A> B 


(d) BIRTHDAY : A— B 


2. Find the natural domain for each of the following calculus-style functions. 


(a) f(x) =sin(a# + e”) 
(b) g(2) 


tan(z) 
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(c) h(x) = Va? —-1 
(d) h(x) = Vx? + rx + 1 (give a decimal approximation) 
3. Imagine a function f : [0,1] — [0,1] and its graph. 


(a) How can you tell from its graph whether f is one-to-one? 
(b) How can you tell from its graph whether f is onto? 


(c) Find formulas for three different calculus-style functions f : [0,1] > 
(0, 1] that are one-to-one and onto, and such that f(0) = 1. 


(d) Define f : [0,1] — [0, 1] by setting f(x) = x for x € Qand f(x) = 
x? for x ¢ Q. Is f one-to-one? Onto? 


4. Consider the function WORDLENGTH : A — Np, discussed in Exam- 
ple 2, page 16, and let G be its graph. 


(a) Describe the set G. (Hint: It is a set of ordered pairs; for instance, 
(May, 3) € G.) 


(b) The WORDLENGTH function is not one-to-one. How does the graph 
G reveal this? 


(c) The WORDLENGTH function is also not onto. How does the graph G 
reveal this? 


5. Acertain function f has graph G = {(a,1), (e, 2), (i,3), (0,4), (u,5)}. 


(a) What are the domain and the (smallest possible) codomain of f? 
(b) The function f is bijective. How can we tell this from the graph? 


(c) Because f is bijective there is an inverse function, f—!. What is the 
graph of f—!? 


(d) Can you think of a more descriptive names than f and f~! for these 
functions? 


6. Acertain function f has graph 


G = { (Montana, 3), (Maine, 4), (Maryland, 10), (Massachusetts, 12), 
(Michigan, 17), (Minnesota, 10), (Mississippi, 6), (Missouri, 11)}. 


(a) What is the domain of f? What is the range? What is a reasonable 
codomain? 


(b) Is f injective? How do you know? 
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(c) Is f surjective? (The answer depends on your choice of codomain.) 


(d) Can you think of a more descriptive name than f for this function? 
7. Consider the linear function L : R > R given by L(x) = 3a 4 2. 


(a) Explain why L is both one-to-one and onto. 


(b) Let M: R — R be given by M(x) = =. Show that L o M(x) = 
Mo L(a) = xforallx ER. 


8. Consider the linear function L : R — R given by L(x) = ax + b, where a 
and 6 are any real numbers, with a F 0. 


(a) Explain why L is both one-to-one and onto. 


(b) Find a formula for an inverse function M = L~!. Check your answer 
by showing that Lo M(x) = Mo L(x) = x forallz ER. 


9. (a) Consider the function f : R — R given by f(x) = 2”. For which 
positive integers n is f one-to-one and onto? 
(b) Consider the function f : (0,00) — (0,00) given by f(x) = x”. For 
which positive integers n is f one-to-one? What if n is negative? 


10. Let f: A— Bandg: B — C be functions. 


(a) Show that if both f and g are one-to-one, then so is g o f. (This is (i) 
of Theorem 1.10, page 21.) 

(b) Show that if g o f is onto, then so is g. (This is (v) of Theorem 1.10, 
page 21.) 

(c) Consider the functions f(x) = e” + 1 and g(x) = 2?, both with 
domain R and codomain R. Show that g o f is one-to-one, but g is 
not. 


11. Consider the functions f(x) = 2? and g(x) = \/z, both with [0, 00) as 
domain and as codomain. 


(a) Explain why f and g are inverse functions. 


(b) Consider the function h : R — [0, 00) with rule h(x) = x? (formed 
from f by tinkering with the domain). Explain why /h and g are not 
inverse functions. 


12. The ordinary sine and arcsine functions are inverses as long as some care is 
taken with domains and codomains. Work out the details—that is, specify 
domains and codomains for these functions that make them inverses. 
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13. Explain why the SAMEBLOODTYPE relation, discussed at the end of this 
section, is reflexive, symmetric, and transitive. 


14. Consider the relation MODS on Z, defined by MoD5 = {(m, n) | 5 divides 
m—n}. 


(a) Show that MODS is an equivalence relation. 


(b) Find the set, denoted by [0], of all integers that are related (by Mod5, 
of course) to 0. Do the same for [1], [2], [3], and [4]. (These are called 
the equivalence classes for MODS.) 


15. Let P be the set of (around 400,000) citizens of Minneapolis, Minnesota. 
One possible relation on P is BROTHER, defined by BROTHER = {(z, y) | 
y is x’s brother}. Notice that BROTHER is transitive, but neither reflexive 
(nobody is his own brother) nor symmetric (think about sisters). Notice 
too that BROTHER is not a function, because some Minneapolitans have no 
Minneapolitan brothers, while others have more than one. 


In each part following, decide whether the relation is (i) reflexive; (11) sym- 
metric; (iii) transitive; (iv) a function. Explain answers briefly. 


(a) SAMELASTNAME 

(b) ANDERSON = {(xz, y) | x and y are Andersons} 
(c) SIBLING 

(d) MAYOR = {(z, y) | y is z’s mayor} 

(ec) OLDESTNEIGHBOR 


1.4 Proofs 101: Proofs and Proof-Writing 


Without proofs, mathematics would not exist. Proof is as fundamental to working 
mathematicians as structured experiments are to chemists or biologists. 

This is not to say that mathematics is nothing but proof. Calculations, hunches, 
experiments, and searches for patterns are all indispensable in mathematics, as 
they are in the sciences. But mathematical certainty, obtained through rigorous 
proof, has no clear analogue in the other sciences, where theories are always open 
to revision or falsification through experiments. The Pythagorean theorem, by 
contrast, has been known to be true for at least 2500 years. 

A first course in real analysis is a natural setting for learning about, and for 
practicing, mathematical proof. Many concepts and objects of real analysis— 
sequences, functions, derivatives, etc.—are familiar, if perhaps only informally, 
from calculus courses. The main theorems are mathematically powerful and in- 
tuitively accessible. Finding and writing proofs takes care and practice, but the 
reward is proportional to the effort. 
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See also the coda at the end of 
this section. 


Pythagoras recognized his 
proof as a big deal. He 
celebrated, legend has it, by 
sacrificing 100 oxen. 
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Or will be, once we have 
defined all the ingredients. 


There are some exceptions, but 
not many. 
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Mathematical Language and Literature 


The beginning of wisdom is to call things by their right names. 


The old Chinese sage who supposedly said so may have had other things in mind, 
but the advice certainly applies to mathematics. Without clear and unambiguous 
language—“right names”—we can’t know exactly what we are talking about, and 
therefore we can’t produce really convincing proofs. 

The moral is that care with language is just as important in mathematics as it 
is in, say, the literary arts. Granted, poets and mathematicians use language very 
differently: good poetry may rely on subtle allusions and shaded meanings, but 
the best mathematical proofs are always clear, direct, and straightforward, even 
when they convey difficult ideas. Both poems and proofs can be praised as ele- 
gant, but the judgment depends on different standards. The good news is that we 
mathematicians need not aspire to fancy artistry: clarity and directness of expres- 
sion are less rarefied arts than practical skills, readily acquired and improved on 
the job. 

Following are some samples, phrased as advice, of characteristics of mathe- 
matical writing. 


Use standard symbols and notations. Using standard notations, of which we’ll 
encounter many in this book, helps shorten, unclutter, and clarify mathematical 
discussion—but only if notations are used consistently, and with care. For in- 
stance, the notations (1,3), [1,3], and {1, 3} all have precise but different mean- 
ings—one is an open interval, one is a closed interval, and one is a set with just 
two members. Straying from these conventions is asking for trouble. It is far from 
clear, for instance, what such notations as 


[x |a2?>2] and Q|(—v2, v2) 
really mean. By contrast, the expressions 
{x |x? >2} and {rE Q| a? <2} 
are clear and unambiguous. 


Define everything. Everyone expects theorems, proofs, and calculations in math- 
ematical writing. Less expected, but equally important, are formal definitions. 
Words in everyday language have fluid meanings, with nuances that depend on 
context. By contrast, words in mathematics have fixed, precise, agreed-upon 
meanings. A typical theorem claims that some mathematical object has some 
mathematical property. Without clear definitions of the objects and properties 
under discussion, proof can’t get started. 
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EXAMPLE 1. Prove this claim: The set of rational numbers is closed under 
addition. 


DISCUSSION. The mathematical object in question is the set of rational num- 
bers. The claimed property, closure under addition, means that the sum of any 
two rational numbers is another rational number. 

It is easy enough to check that the claim holds for any particular pair of ratio- 
nal numbers. For the rationals 3/5 and 2/7, for instance, we use a common de- 
nominator to calculate the sum, 31/35, and observe that it is rational, as claimed. 
A general proof, however, requires a general definition, and to this end we define 
a rational number as any ratio of integers a/b, where b 4 0. While we’re at it, 
let’s also sharpen our description of the closure property: a set S is closed under 
addition if r + s is amember of S whenever r and s are members of S. Now we 
can restate the claim for our convenience: 


If x and y are rational numbers, then so is x + y. 


With clear definitions at hand we’re ready for a concise proof. 


PROOF. Since x and y are rational, we can write x = a/b and y = c/d, where 
a, b, c, and d are integers, and b and d are nonzero. Now we use some fraction 
arithmetic: 


* a Ges Ue Geis 
edb a bd bd hd 
a rational number, and the proof is done. ©) 


Fair game? Few proofs are completely from scratch. Even the simple proof 
above relies on some very basic properties of integers: sums and products of in- 
tegers are always integers, and the product of nonzero integers is always nonzero. 
(In this book we will freely assume and use such basic properties of integer arith- 
metic.) Knowing just which assumptions are safe and which need proof can be 
tricky—especially when assumptions are “understood” rather than stated explic- 
itly. Learning to sort out such matters is part of the craft of proof. 


Write in complete “sentences.” The quotes are there because mathematical sen- 
tences in mathematics may include not just ordinary words but also symbols, 
equations, inequalities, etc. For instance, it is fine to write 


because x > 3 we know 2? > 9 and x? > 27, 
but it’s bad form to write 


xz > 3,27 >9,2° > 27. 
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See the common denominator? 
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We will study the Archimedean 
principle in Section 1.6. 
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One problem with the latter is the lack of “connective tissue”: a reader can’t tell 
whether you’re asserting that something implies something else, or just listing 
your favorite inequalities. At any cost, be clear. 


Use standard grammatical niceties. These devices—such as punctuation and 
capital letters—are used in ordinary English to help the reader see clearly what’s 
being said or implied. Consider, for instance, a notice supposedly posted near an 
Australian beach: 


Crocodiles don’t swim here 


Would you swim here or not? Is the sign intended for human or for reptile readers? 

In practice, many mathematical sentences convey complex ideas, and so nat- 
urally have correspondingly complex structures. It is especially important, there- 
fore, to write mathematics as clearly and unambiguously as possible, and to help 
the reader decipher your meaning. 


Write sentences that “scan.” Proofs and solutions must be not only correct but 
also intelligible to a reader. An excellent way to assure the latter is to read each 
sentence back to yourself. (Doing this silently may reduce ridicule from neigh- 
bors.) A sentence with proper English grammar and syntax may be mathemat- 
ically right or wrong; every mathematician has seen eloquent proofs that boil 
down to nonsense. But an ungrammatical sentence is almost surely wrong or, 
worse, meaningless to a reader. 


Make sense. Ask carefully whether what you write makes sense by the strict 
standards of mathematical writing. For instance, the sentence 


For all Q, we have x? > 0. 


is nonsensical (do you see why?) and hence neither true nor false. Your first job 
is to write sentences that make sense. Your second job is to write sentences that 
are true. 


Watch word order. Because mathematical language conveys complex thoughts, 
word order can be crucial. Consider these two statements: 


For every positive € there exists an integer n such that 0 <1/n <e. 


There exists an integer n such that 0 < 1/n < «€ for every positive €. 


The first statement is true; it is a version of so-called Archimedean principle for 
real numbers. The second statement is meaningful, but false. 
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Just don’t do “it”, The harmless-looking pronoun “it” commits countless crimes 
in mathematics. Here, for example, is a confusing way to describe an important 
connection between a function f and its derivative f’: 


It’s maximum or minimum when it’s flat, and that happens when it’s 
a zero of its derivative. 


That’s way too many pronouns, and who knows what each refers to? Just say no: 


A function f may have a maximum or minimum value where the 
graph of f is flat. This can occur at a value of x for which f’ (x) = 0. 


Make it look easy. A musician planning a recital invests hours of practice and 
study, and hits plenty of false notes. The recital itself skips all of this practice 
and study, and most of the false notes. In the same way, a finished mathematical 
proof should be the polished result, rather than the basic process, of whatever 
informal thinking, experiments and false starts may have happened along the way. 
It is sometimes helpful to hint at the investigative phase of proving a result, but it 
is important not to confuse such material with the proof itself. Good proofs are 
clear, concise, and couched in standard mathematical language. 


Don’t say too much—or too little. Respect, but don’t overtax, your reader’s intel- 
ligence and willingness to work. Ideas in your proof should be clear and accessi- 
ble to your reader—someone with your own level of intelligence and knowledge. 


1.4.1 Proof Language: A Lexicon 


Mathematics, like every discipline, has its own body of technical language. Spe- 
cialized vocabulary (“jargon” to its detractors) is sometimes impugned as will- 
fully obscure, but exactly the opposite is true: well-chosen technical language is 
essential to clear and concise discussion of technical subjects. To get these bene- 
fits, of course, we’ll need to know and agree on what technical words mean. The 
following little lexicon of important “proof words” begins that project. A general 
warning: Some technical words, such as “statement,” have everyday meanings 
somewhat different from their mathematical uses. 


Statement. A mathematical statement is a phrase or sentence that can reasonably 
be called either true or false. Here are some possibilities: 


P:3>65 Q : It is cloudy. R: Itis raining. S : Life is good. 


Statements P, Q, and R are all good examples. Statement P is obviously false, 
but mathematically meaningful. Statements Q and R are also fine because, at a 
given time and place, each is either clearly true or clearly false. Statement S' is 
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We could, but won't, argue over 
whether this is for better or for 
worse. 
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Things might change if we 
allowed, say, imaginary number 
inputs. 


Solving equations, as in linear 
algebra, is all about searching 
for equivalent, but simpler, 
equations. 


1. Preliminaries: Numbers, Sets, Proofs, and Bounds 


too vague to make our cut, and we’ll leave the fun of debating it to colleagues 
across campus. 

Complex statements can be built up from simpler ones. Using ingredients 
above, for instance, we can create blends like these: 


if R then Q : If it is raining, then it is cloudy. 


and 
Q and not R : It is cloudy but not raining. 


Variables. Some statements involve variables: symbols that stand in for unspec- 
ified inputs. For such statements we sometimes use names like P(a) to emphasize 
the presence of variables. Here are some examples: 


P(e) a? 23 Q(y) : y is Fred’s brother. R(z,y): 2° +y? > -3 


The truth or falsity of such statements usually depends on the values of the vari- 
ables involved. Here, for instance, P(7) is false, P(—1) is true, Q(Sue) is proba- 
bly false, and Q(Ed) might be true. In this case R(x, y) happens to be true for all 
real number inputs x and y. 


Implication and equivalence. An implication is a statement of the form “if P 
then Q”; we often write P ==» Q. This means, of course, that @ is true 
whenever P is. With P : 2 > 3 and Q : x? > 9, for example, 


P—Q means c>3 => 2?>9 


and 
Q=— P means x? >9 = 2>3. 


Here, clearly, the implication P ==> Q is true: x? > 9 does indeed hold 
whenever x > 3. But the implication @ == P is false; try x = —42. An 
important moral is that implication is, by default, a one-way street: P => Q is 
no guarantee thatQ => P. 

If it happens that both P => QandQ ==> PP, then P and Q always 
have the same truth values, and are therefore called equivalent. The statements 
P(x) : 24 +5 = 11 and Q(a) : x = 3 are equivalent, for instance. 


And, or. Given statements P and Q, we can form new statements ( P and Q ) 
and (Por@). (The parentheses aren’t essential, but they help keep the right 
things together.) “And” is used mathematically much as it is in everyday speech: 
( P and Q) is true if, but only if, both P and Q are true. “Or” is a little different: 
We take ( P or Q ) to be true when either or both of P and Q is true. This conven- 
tion, called the inclusive or, differs a bit from the exclusive or, sometimes written 
xor, that’s common in everyday life: A child might be offered either candy or ice 
cream, but not both. 


1.4. Proofs 101: Proofs and Proof-Writing 


Negation. The negation of a statement P is a new statement, which we’ll denote 
not P, 


whose truth value is opposite to that of P. Negating simple statements is easy. 
With 
R: Itis raining, 


for instance, we have simply 


not # : It is not raining. 


Negating complicated statements can take some thought. Consider, for instance, 
Goldbach’s conjecture: 


G : Every even integer greater than 2 is the sum of two primes. 
How would we negate G’? We could, admittedly, write something like 
It is not the case that G, 


but this is unhelpful. A better approach is to notice that G asserts that every even 
integer has a certain property. To negate such a claim is to say that some even in- 
teger does not have this property: 


not G: Some even integer greater than 2 is not the sum of two primes. 


In other words, to disprove G it is enough to find even one big even number that 
is not the sum of two primes. 

In a similar spirit, De Morgan’s laws tell how to negate statements that involve 
and and or: 


not (PandQ@) isequivalentto (not P) or (notQ); 
not(PorQ) isequivalentto (not P) and (notQ). 


Notice, especially, that negating and statements produces or statements, and vice 
versa. 


The meaning of existence. Consider two typical, analysis-style statements about 
real numbers: 


(i) Forall x > 3,27 +9 > 62. 
(ii) sina = 2/2 for some z € [1, 2]. 


Statement (i) says that a certain property holds for all x € (3,00), while (ii) says 
that a number with a certain property exists in the set [1, 2]. The for all and there 
exists constructions are everywhere in mathematics, and have their own symbols: 
V and J, respectively. Using these symbols we could rewrite 
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This famous unsolved problem 
dates back to the 1740s, in 
correspondence between 
Christian Goldbach and 
Leonhard Euler. 


And achieve fame. 
It's been tried .... 


Named for a nineteenth-century 
British mathematician. 


They happen to be true. 


34 
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Which we dare not confuse .... 


Common sense bears this out; 
we explore it further in the 
exercises. 
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(i) Vx € (3,00), we have x? + 9 > 62; 


(ii) Sa € [1, 2] such that sin x = «7/2; 
or even just 


(i) Vx € (3,00) (27+9> 6x); 


(ii) de € [1,2] (sine = 2/2); 


if brevity is really important. 

The symbol V is known as the universal quantifier; Vz P(x) means that the 
property P(x) holds for every x in some “universe” of discussion. Similarly, 5 is 
the existential quantifier; Iz P(x) means that P(x) holds for at least one x. 

The V and 4 symbols arise frequently in analysis, often together. For instance, 


the statement 


Ve € Ran €N such thatxz <n 


says—truthfully—that every real number xz is smaller than some integer. The 
superficially similar-looking statement 


«2 € RsuchthatVneNa<n 


says something different—and false. 


Converse and contrapositive. For some reason, C-words are disproportionately 
common among crucial components of concise corroborations. Can this be just 
curious coincidence? 

An implication P = > Q has both a converse and a contrapositive. The 
converse of P => Q is the implication@ ==> P; the contrapositive is the 
implication (notQ) => (not P). If, say, P: 2 > 3andQ: x? > 9, then we 
have 


P=>Q: #«>3=—>27>9 (the original); 
Q=> P: «#>9=—> 2£>3 (the converse); 
(notQ) = > (notP): 2? <9 = 2x<3 (the contrapositive). 


As noted earlier, an implication and its converse need not have the same truth 
value—and they do not in the present case. An implication and its contrapositive, 
on the other hand, always have the same truth value. This matters in mathematical 
practice, as we will see, because the contrapositive of a statement is sometimes 
easier to prove than the original. 


1.4. Proofs 101: Proofs and Proof-Writing 


Counterexample. For which integer powers p is 2? — 1 prime? Back-of-the- 
envelope calculations show that this happens if p is 2, 3, 5, or 7; and so, running 
out of envelope, we might guess that 2? — 1 is prime for all prime powers p. Not 
so, alas: 2''—1 = 2047, which is not prime. In proof talk, 11 is a counterexample 
to our hasty claim. 

A counterexample is just what the name implies: a single example that coun- 
ters an incorrect general claim. When, as above, the claim asserts that some 
property P(x) holds for all x in a given set, a counterexample is simply some 
x for which P(x) fails. Mathematicians love counterexamples, perhaps for the 
pleasure they can offer in toppling grand theories with humble facts. 


Coda: new-age proofs. Proofs are no less important now than they’ve ever been. 
But modern viewpoints and, especially, modern technology have made new kinds 
of mathematics possible—and sometimes require new kinds of proof. For exam- 
ple, the four-color theorem (four colors suffice to color any planar map of “coun- 
tries”) was posed in 1852, but proved only in 1976, with aid from a computer to 
check hundreds of special cases. Computer-aided proofs are now common, but 
they remain controversial. 


Exercises 


1. This problem is about three statements: 
R: It is raining. S : Itis sunny. C : It is cloudy. 


In each part following, rewrite the given implication in ordinary language. 
Then do the same for the converse and the contrapositive. Label all state- 
ments as true or false. 


(a) R = nots 

(b) R= C 

(c) R => (not S) andC 
(d) C => nots 


2. This problem is about three statements: 
P:2>3; Q:a<A4; R:9<a2? <16. 
In each part following, rewrite the given implication as simply as possi- 


ble. Then do the same for the converse and the contrapositive. Label all 
statements as true or false; no proofs needed. 
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a) P= R 
b)Q => R 
(c) (P and Q) = R 


3. In each part, write (as simply as possible) both the converse and the contra- 
positive of the given implication. No proofs needed, but try to label each 
statement as true or false. In all parts, a, b, ay, etc. all stand for real num- 


bers. 


(a) If a and 6 are both rational, then a + 0 is rational. 


(b) If a is irrational then 1/a is irrational, too. 


(c) If a and 0 are both irrational, then abd is irrational. 


(d) Ifa series 5* ap, converges, then limps. Gn = 0. 


4. In each part, write (as simply as possible) the negation of the given state- 
ment. 


(a) At least one of a, b, and c is nonnegative. 
(b) f(x) < 3 forall x € [2,7]. 


(c) 
(d) 


sin n is irrational for every positive integer n. 


Ja € R such that 2? = —1. 


(e) Ve €R Jy € Rsuch that cy = 1. 


5. Negations. For each statement following, write the negation as simply as 
possible. Then try to decide (some parts may be hard) whether each state- 
ment is true or false; no proofs needed. 


: cos(x) 
: cos(x) < 0 for all x € [2,3]. 


: Every even integer n > 2 is the sum of two (not necessarily distinct) 


SSN GCHunwOY 


: All dogs have fleas. 
: Several dogs have fleas. 


: My dog, Spike, is flea-free. 


V¥n—1++</n-+ 1is irrational for every positive integer n. 


VY¥n—1+</n-+ Lis rational for every positive integer n. 
< 1.001 for all x > 0. 


primes. 


: Every set of ten distinct numbers has a largest and a smallest element. 


: Every subset of the interval [0, 1] contains a largest element. 


1.5. Types of Proof 


6. Counterexamples. At least three of the statements P—Z in the preceding 
problem are false. Choose any three of these false statements. For each 
one, carefully state a counterexample that shows the statement to be false. 


7. Converses and contrapositives. In each part following is an implication: a 
statement of the form “If P, then Q”. In each part, first write the converse 
and the contrapositive of the given implication. Then label each of the three 
statements as true or false; no need to prove anything. 


(a) If x > 3 then x? > 9. 


(b) If x > 3 then x? — 42? + 32 > 0. 
(c) Ifa > Oand b > O then Ja + 6b] = Ja] + |d]. 


8. What is the contrapositive of the contrapositive of an implication? Explain. 


9. Following are several possibly true but poorly-stated claims from elemen- 
tary calculus. Fix each statement by replacing all instances of “it” and 
“its” and “it’s” with clearer words or phrases. (An example appears on 


page 31.) 


(a) If it’s increasing for all x then its derivative is nonnegative. 
(b) At its maximum either its derivative is zero or it does not exist. 
(c) If its second derivative is positive for all x then it’s concave up. 


(d) If a series converges then it goes to zero. 


10. Each of the following sentences has one or more syntax errors. In each 
case, make a clear (and true) sentence with as little editing as possible. 


(a) If x? 4 Q, then x F Q either. 


(b) /2is not Q but 2’ is Q. 
(c) VNER. 
(d) sin (R) C [1,1]. 


1.5 Types of Proof 


Mathematical assertions come in enormous variety. Here is a tiny sample, includ- 
ing at least one that is false: 


I: If a and b are rational numbers, then so is ab. 


Il: If a and 0 are irrational, then so is a + b. 
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integer basics. 
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Ill: If a is irrational, then so is 1/a. 
IV: /5 is irrational. 
V: There are infinitely many prime numbers. 
VI: Every even integer from 4 to 100 is the sum of two primes. 
VII: Every even integer greater than 2 is the sum of two primes. 
VII: A finite set with n elements has 2” subsets. 


Proofs (and disproofs, when needed) come in similar variety; no simple taxonomy 
is possible. Still, a few standard types of argument are so common that they appear 
in every mathematician’s toolkit. We name and illustrate several of them in this 
section. 


Hypotheses and conclusions. A typical mathematical assertion (theorem, claim, 
proposition, lemma, corollary, etc.) starts with one or more hypotheses and derives 
a conclusion. This structure is often—but not always—explicit in the language it- 
self. All of I-III above, for instance, have the general form of an implication: if 
[hypothesis], then [conclusion]. 

For other assertions, including [V-VHI above, the hypotheses are implicit or 
understood rather than stated explicitly. If, for some reason, we want to state 
hypotheses more explicitly we could rewrite Claim IV, for instance, as 


IV’: If x? = 2, then z is irrational. 


Claim V is harder to restate in if-then form because its hypotheses, such as the 
definition of a prime number, would be tedious to list. 

The form of a claim often suggests possible strategies for proving or refuting 
it. In the case of an if P then Q claim, for instance, we might attack either the 
claim itself or its contrapositive, if not Q then not P. For broad claims like VII 
and VIII, each of which covers infinitely many cases, we should expect to work 
harder for a proof—or maybe look for even one counterexample as a disproof. 


Direct Proof 
A direct proof addresses a claim if P then Q in the “obvious” way—it starts with 


P and derives Q. We illustrate with Claim I. 


EXAMPLE 1. Prove directly (using basic properties of integer arithmetic) that if 
a and b are rational numbers, then ab is rational. 


1.5. Types of Proof 


SOLUTION. Because a and } are rational, we can write 


a=— and b= 


z 
me 
W 


where x, y, z, w are all integers, and y 4 0 4 w. Now we multiply: 
ia fa) 
y wWwyw 
which exhibits ab as the desired ratio of integers, and completes the proof. 
Note, by the way, that with a few more words we could also establish that 
ab = ba. (We did something similar with sums in Example 2, page 4.) ©) 


Indirect Proof 


An indirect proof, like a direct one, addresses a claim of the form if P, then Q. But 
there is a twist: instead of showing that P implies Q, we prove the contrapositive 
(but equivalent!) statement: 


if (not Q) then (not P). 


An indirect proof may be the simplest choice when either P or Q is awkward, but 
(not P) or (not Q) is simpler. We illustrate with Claim III. 


EXAMPLE 2. Prove indirectly: if a is irrational, then 1/a is irrational too. 


SOLUTION. A direct argument seems awkward here because the hypothesis says 
something negative about a. The contrapositive is simple and straightforward: if 
1/a is rational, then a is rational. 

This is easily shown. If 1/a is rational, then we can write 1/a = 2/y for 
some (nonzero) integers x and y. But then we have a = y/x, which shows that a 
is rational, as desired. ©) 


EXAMPLE 3. Discuss Claim II: If a and b are irrational, then so is a + b. 


SOLUTION. The direct approach looks unpromising, as it did in Example 2. An 
indirect proof might seem the next resort. But there is a catch: Claim II is false, 
and any counterexample will demolish it. If, say, a = J/2 and b = —1/2, then 
both a and bare irrational, but a+b = 0, which is rational, so our counterexample 
is good. 

Our counterexample depends, of course, on the fact that /2 is irrational, as 
we showed in Theorem 1.2, page 6. Observe, however, that any irrational number 
would do in place of V2. ©) 
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Proof by Contradiction 


Proofs by contradiction are close kin to indirect proofs. In each case we first 
assume that the conclusion fails, and then try to deduce a contradiction, either of 
the hypothesis or of some other known fact. From the resulting absurdity we infer 
that the original conclusion must have been true all along. (The method is also 
known as reductio ad absurdum, Latin for “reduction to the absurd.”) 


EXAMPLE 4. Prove Claim IV: V5 is irrational. 


SOLUTION. We could tweak the proof of Theorem 1.2, page 6, but for variety 
we'll take an approach based on prime factorization: Every positive integer n 
has a unique list of prime factors, some of which may be repeated. (For n = 60 
the list is {2,2,3,5}.) The key insight for our proof is about squaring: each 
prime factor of n appears twice as often among the prime factors of n?. (For 
n? = 602, the prime factors are {2, 2, 2,2,3,3,5,5}.) In particular, every square 
integer has an even number of prime factors. So much said, we’re ready for a crisp 
proof. 


PRoor. Assume, toward contradiction, that \5 is rational. Then we can write 
V5 = a/b for some positive integers a and b, and so 


J/5 ; 5 a Bb? = a2. 


The last equation provides our contradiction. The right side is a square, and there- 
fore has an even number of prime factors. But the left side has an odd number of 
prime factors—an even number coming from b? and one more from the 5. This 
absurdity completes the proof. © 


EXAMPLE 5. Prove Claim V: There are infinitely many prime numbers. 


SOLUTION. The following beautiful proof is attributed to Euclid, a Greek math- 
ematician who worked at Alexandria, Egypt, around 300 BCE. Euclid’s idea can 
be phrased as a proof by contradiction. 


PROoF. Assume, toward contradiction, that there are only finitely many primes, 
Say P1, 22, P3,---;Pn- Now consider the number N = pi p2p3--- Pn + 1. By its 
construction, N is not divisible by any of the primes p1, p2,p3, ..-, Pn. Hence 
either N is itself prime or N has at least one prime factor not among pj, p2, 
13, +--+; Pn. Either way, the list {p1, po, p3,-..-,Pn} could not have been com- 
plete. This contradiction completes the proof. © 


1.5. Types of Proof 


Brute Force, Cases, Exhaustion, Enumeration 


Satisfying as it is to dispatch broad claims with elegant proofs like Euclid’s, some 
claims resist such treatment. Sometimes we just argue case-by-case, through 
several—or even many—different parts of a general assertion. Such “brute force” 
or “exhaustive” methods can be ugly, but sometimes there is no alternative. And 
working through special cases, especially with a computer’s help, can reveal im- 
portant patterns. 


EXAMPLE 6. Prove Claim VI: Every even integer from 4 to 100 is the sum of 
two primes. 


SOLUTION. Why not just prove Claim VII—every even integer greater than 
two is the sum of two primes—and be done with it? That would indeed do 
the job, but Claim VII is Goldbach’s conjecture, a famous problem dating to 
the 1740s. Considering that Claim VII has already stumped the likes of Euler, 
we'll stick with Claim VI. Handling its 49 special cases is easy, although perhaps 
tedious: 


4=242; 6=34+3; 8=34+5; ... 98=114+87; 100=7493. 
0) 


Proof by Induction 


Mathematical induction is among every mathematician’s favorite power tools. It 
is a simple, structured, and sometimes astonishingly powerful approach to prov- 
ing whole families of claims at once. We illustrate the idea first informally, by 
example. 


EXAMPLE 7. Discuss Claim VIII: A set with n elements has 2” subsets. 


SOLUTION. Claim VIII is really an infinite collection of propositions P(1), 
P(2), P(3), ..., one for each positive integer n. To emphasize this structure, 
we can list the propositions individually (we may as well use {1,2,...,n} asa 
convenient n-member set): 


P(1): {1} has 2 subsets, 
P(2): {1,2} has 4 subsets, 


P(42): {1,2,...,42} has 2” subsets. 
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alternative, but it’s hard to find. 


Try some more! 
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Every set has @ as a subset. 


See the pattern? 
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It is easy in this case to check P(n) for small n, simply by listing. For P(1), the 
two subsets are {1} and @. For n = 2 the list is 


0, {1}, {2}, {1,2}. 


With n = 3 we get eight subsets, as expected: four from the n = 2 case plus four 
more that include a 3: 


0, ae {2}, {1,2}; {3}, {1,3}, {2,3}, {1,2,3}. 
The basic ingredients of a proof should now be clear: 
(i) a set with one element “obviously” has two subsets; and 


(ii) adding a new element to any finite set doubles the total number of subsets 
(every “old” subset generates a new one when we throw in the new element). 


Hence sets with 1, 2, 3, 4,... elements have 2, 4, 8, 16,... subsets, as desired. 


% 


Induction, formally. Example 7 illustrates the usual setting for a proof by induc- 
tion: a claim that some proposition P(n) holds for every positive integer n. To 
prove such a claim by induction takes two (named) steps: 


° The base case: Show that P(1) holds. 


© The inductive step: Show that P(k) implies P(k& + 1) for every positive 
integer k. (Here P(k) is called the inductive hypothesis.) 


We illustrate the idea (and the customary shop talk) by formalizing the proof 
that, for all n, the set {1,2,...,42} has 2” subsets. 


Proof (of Claim VIII, by induction): The base case n = 1 holds because the set 
{1} has only itself and the empty set as subsets. 

For the inductive step we assume the inductive hypothesis—that {1,2,...,k} 
has 2" subsets—and try to show that {1,2,...,k,k + 1} has 2**1 subsets. To 
this end, note first that all 2” subsets of {1,2,..., k} are also subsets of {1,2,..., 
k +1}. Every remaining subset of {1,2,...,4+1} contains k+ 1, and so can be 
formed by adding k + 1 to some subset of {1,2,...,&}. Thus {1,2,...,k,k+1} 
has 2 x 2* = 2*+1 elements, as desired, and the proof is complete. 


After one more simple example we’ll draw some conclusions. 


EXAMPLE 8. The identity 1+ 2+3+---+n= 24 holds for all positive 


integers n. 


1.5. Types of Proof 


SOLUTION. We'll write P(n) for the identity above; for instance, P(10) says 
that 1 +2+---+10 = 10-11/2 = 55. This is easily checked directly, but 
we’d rather prove P(n) for all integers n—just the right job for mathematical 
induction. 


PROOF BY INDUCTION. The base case, P(1), holds by a trivial calculation— 
both sides of the equation are 1. 

For the inductive step we assume equation P(k) and derive P(k + 1). A little 
algebra is all we need: 


1+2t--th+(h+1) =ATO yyy 
=(k+1) ($+1) = ruer?) 


That the first and last quantities are equal is just what P(k + 1) asserts. The proof 
is complete. © 


Notes on the method. Mathematical induction is so useful and so ubiquitous in 
mathematics that some general themes deserve mention. 


* Not cheating: To many conscientious newcomers the inductive step in a 
proof feels like cheating—assuming the desired result in order to prove it. 
Caution is always wise, but here we’re OK: this part of the proof shows 
only that P(k) implies P(k + 1), not that P(k) itself must hold. 


Dominoes, ladders, lily pads: Choose your favorite metaphor—different 
authors use different real-world images to illustrate mathematical induction. 
My favorite involves dominoes labeled P(1), P(2), P(3), etc., arranged in 
the classical childhood manner (if only the child had infinitely many), so 
that every P(n) falls to the right, toppling P(n + 1). The inductive idea is 
now a domino theory: if (i) the first domino falls; and (ii) each domino fells 
its right-hand neighbor, then a// the dominoes fall (to the child’s infinite 
delight). 


Ladders (getting to the first rung is the base step) and frogs crossing lily 
ponds, pad by pad, lead to similar stories—flesh out your own details. 


Other bases: Not every claim for which induction looks promising starts 
with the n = 1 case. For instance, the inequality n? < 2” fails for some 
small integers, but holds whenever n > 5. An inductive proof is possible, 
with the small twist that the base case now corresponds to n = 5. Claim 
VII (every even integer from 4 on is the sum of two primes) is another case 
in point: an inductive proof would have base case n = 4. (Whether an 
inductive proof can be found is another matter entirely.) 


adding k +1 to both sides of 
P(k) 


by algebra 


See the exercises. 
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Why induction works: Inductive proofs themselves may be hard or easy, 
but the underlying “domino theory” is simple and plausible. A rigorous 
proof that mathematical induction “works” depends on an axiom about the 
natural numbers: 


If S C N is such that (i) 1 € S, and (ii) n + 1 € S whenever 
né€ S,then S =N. 


We’ll take this basic fact, like others about the integers, as an axiom rather 
than something to be proved. 


Exercises 


1. Ineach part of this problem, either prove or disprove the given claim. If you 
prove the claim, indicate whether your proof is direct, indirect, by contra- 
diction, or something else. If you disprove the claim, use a counterexample. 
(It’s OK to assume known facts, such as that V2 is irrational.) In all cases, 
x and y are real numbers. 


(a) Ife €Q, then V2+2¢Q. 
(b) Ifa ¢Q, then /2+2¢Q. 


(c) If x + y is irrational, then at least one of x and y is irrational. 


(d) If pis a prime number, then 2? — 1 is prime. 
(e) For all real numbers x and y, | — y| < |x? — y?|. 
2. In each part following, either prove or disprove the converse of the given 
claim. If you prove the converse, indicate whether your proof is direct, 
indirect, by contradiction, or something else. If you disprove the converse, 


use a counterexample. (Assume known facts, such as that \/2 is irrational.) 
In all cases, x and y are real numbers. 


(a) Ife €Q then V2+2¢Q. 
(b) Ifx ¢ Q, then V2+2¢Q. 


(c) If x + y is irrational, then at least one of x and y is irrational. 


3. Consider the equation 1 + 34+5+---+(2n—1)=n?. 


(a) Check directly that the equation holds for n = 1 and for n = 10. 
(b) Prove by induction that the formula holds for all positive integers n. 


(c) Use (don’t reprove) the formula in Example 8 to give another proof 
(not involving induction) of the equation above. 


1.5. Types of Proof 


4. 


9. 


10. 


Let r £ 1 be a real number. Show by induction that for all positive inte- 


gers n, 
prt _ 4 


DR PP en 
(= 


. Show that the inequality 2” < n! holds for all integers n > 3. 


. Guess a formula (in terms of 7) for the sum 


1 1 1 1 


[2 2 a ay 


Prove your answer by induction. 


. Show that 


(1+2434+---+n)? =194294+3%+---4+n° 


holds for all positive integers n. (Hint: Use the formula for 1+ 2+---+n 
in Example 8.) 


. A version of Theorem 1.3 says that if n is a positive integer and ,/n is 


rational, then n is a perfect square. Prove this using the following outline, 
the idea of prime factorization, and the fact that a positive integer n is a 
perfect square if and only if every prime factor of n appears to an even 
power. See also Example 4, page 40. 


If ,/n = a/b, where a and b are positive integers, then squaring both sides 
gives nb? = a?. Now factor each side of this equation as a product of 
prime numbers. Because the right side is a square, each prime factor on the 
right side appears to an even power, and so the same must be true on the 
left. Each prime factor of b? appears to an even power, and so (do you see 
why?) each prime factor of n must also appear to an even power. 


(a) Show that 5” > n! for positive integers n < 12. 
(b) Show that 5” < n! for positive integers n > 12. 


(a) Guess a formula (in terms of n) for the sum 1-2+2-3+3-4+---+ 
n-(n +1). Prove your answer by induction. 


(b) Itis well known, and readily proved by induction, that 


” 1)(Qn+1 a 1 
es A n+1) and y= et ) 
j=l j=l 


Use these facts to give another proof of the result in (a). 
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11. Every calculus student knows that if f(a) = x” for any positive integer n, 
then f’(x) = nz”~1. Prove this by induction, assuming (i) if f(z) = 2, 
then f’(x) = 2; and (ii) the product rule. (We’Il define the derivative, state 
and prove the product rule, and firm up other ideas later in this book; here 
the point is to see induction in a familiar setting.) 


12. Another familiar calculus formula (see the preceding problem) says that 


: 1 n 
if g(x) = 7’ then g' (x) = ~ ee 


for every positive integer n. Prove this by induction, assuming both the 
n = 1 case and the product rule. 


13. Claim: In any group of n kittens, if one is orange, then all are orange. 


What is wrong with the following “proof” by induction? 


Proof: The claim is trivial if n = 1, so the base case holds. To illustrate the 
inductive step, assume the claim holds for n = 42. Suppose we’re given a 
group of 43 kittens, including at least one—say, Hans—that’s orange. Any 
other kitten—say Fritz—can be put in some 42-member group with Hans, 
and so Fritz must also be orange by the inductive hypothesis. Thus all 43 
kittens are orange, and the proof is done. 


1.6 Sets 103: Finite and Infinite Sets; Cardinality 
How “big” is a set? 


Finite sets: few surprises. “Small” sets offer few surprises. It seems clear, for 
instance, that S = {a, b}, with two elements, is “smaller” than T = {a, b, c}, with 
three. Similarly, Naz = {1,2,...,42} is obviously “smaller” than FIFTYSTATES 
= {Alabama, Alaska, ..., Wisconsin, Wyoming}, even though the two sets have 
completely different types of elements, while Nog and ENGLISHALPHABET = 
{a, b,c, ..., z } have the same “size.” 

“Measuring” sets by counting their elements works well for sets with finitely 
many elements. For such sets, moreover, our intuition is usually reliable. If, say, 
a set S has 427 elements and T ¢ S, then 7’ must be “strictly smaller,’ with 426 
elements or fewer.. 


Infinite sets: many surprises. Matters are very different for infinite sets. Con- 
sider, for instance, 


N = {1,2,3,4,... and E=({2,4,6,8...}. 
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In one way & seems obviously “smaller”—it omits all of the (infinitely many!) odd 
numbers in N. On the other hand, the mapping 


36, ..., nr 2n, 


is a one-to-one correspondence between N and EF, so maybe the two sets have the 
“same size.” Similarly, the interval (0, 1) is “shorter” than the interval (0,2), but 
the same mapping, x +> 22, is a one-to-one correspondence. An important aim 
of this section is to develop useful ways of “measuring” infinite sets. 


Cardinality: The Idea 


So how should we gauge “size” for infinite sets? The answer for present purposes 
involves cardinality. We need some clear definitions, and an excuse to drop all 
those lawyerly quotation marks: 


Definition 1.13. Two nonempty sets A and B have the same cardinality if there 
is a function f : A — B that is both one-to-one and onto. In this case we write 
A~B. 


Definition 1.14. A set A is finite if either A = @ or A ~ {1,2,...,n} for some 
positive integer n. Otherwise, A is infinite. 


Informally speaking, two sets have the same cardinality if there is a one-to- 
one correspondence between them. We saw this above for N and EF and also for 
(0, 1) and (0, 2); thus, N ~ E and (0,1) ~ (0, 2). 


EXAMPLE 1. Show that (0, 1), (—2, 5), and (0, oo) all have the same cardinality. 


SOLUTION. It is enough to find one-to-one and onto functions f : (0,1) — 
(—2,5) and g : (0,00) — (0,1). Figure 1.1 hints at two of the many possibilities. 

The function f shown is the linear function whose graph passes through 
(0, —2) and (1,5); its formula turns out to be f(x) = 7x — 2. The function g 
shown has formula g(a”) = x/(1 +2). Graphs of f and g illustrate their one-to- 
one and onto properties; honest proofs require the formulas. 

To prove, for instance, that g is onto, we have to find, for each b € (0,1), 
some a € (0,00) with f(a) = a/(1 + a) = b. A little symbolic work reveals the 
recipe: 


E =b = a=b+ab = a-ab=b => a=. 
l+a 1—b 


Thus, for any given b € (0,1), the positive number a = b/(1 — b) satisfies 
g(a) = b. © 


Check this easy calculation. 
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We can describe this 
correspondence a little more 
formally; see the exercises. 


As indeed we do in this book. 
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O05 - 


10 20 


(a) Graph of f : (0,1) — (—2,5) (b) Graph of g : (0,00) — (0, 1) 


Figure 1.1. Showing that sets have the same cardinality. 


EXAMPLE 2. Show that N and Z have the same cardinality. 


SOLUTION. The following table suggests one possible one-to-one correspon- 
dence: 


The key property should be clear: every z € Z appears once and only once in the 
second row. The integer z = —42, for instance, corresponds to n = 85. © 


We’ ve just seen that Z can be “paired up with” or “counted off against’””» N— 
even though Z is, in another sense, a bigger set. The following definition captures 
this possibility: 


Definition 1.15. A nonempty set A is countable if either A is finite or A has the 
same cardinality as N. In the latter case, A is countably infinite. 


We’ve already seen several examples, including Z, of countably infinite sets. 
We’ll show soon, too, that Q is countable. (This is probably surprising—after all, 
between any two rationals lie infinitely many others, so Q might seem “larger” 
than N.) We might wonder, then, whether any set of numbers, even R itself, is 
uncountable, i.e., too big to be in one-to-one correspondence with N. The answer, 
as we’ll see, turns out to be yes. 


Working with Countable Sets 


Finite sets. Finite sets have commonsense properties as far as cardinality is con- 
cerned, at least if we assume some basic properties of the integers. Following are 
some properties we will accept, without formal proofs, as intuitively reasonable. 
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° The pigeonhole principle: If a (finite!) hotel has fewer rooms than guests, 
then at least one room must house at least two guests. This simple but useful 
idea is known as the pigeonhole principle (trade rooms for nest boxes and 
guests for pigeons). Here is a more formal statement: 


Let f : A — B be any function, where A and B are finite sets, 
and B has fewer members than A. Then f is not one-to-one. 


° Subsets are finite, too: If Aisa finite set and B C A, then B is finite, too. 


© New finite sets from old: If A and B are finite sets, then A U B, AN B, 
A\ B,and A x B are all finite. 


© Listing and ordering: If A is any finite set, say with 42 elements, then we 
can list the elements: A = {a1, a2, @3,...,@42}. If A happens to be a finite 
set of real numbers, we can order our list from smallest to largest: 


ay < a2 < a3 < +++ << 49 < G41 < G42. 


Biggest and smallest members. The last property above has a useful form that 
deserves special mention: 


Fact 1.16. Every finite set of real numbers contains a maximum and a minimum 
element. 


The words “finite” and “contains” both matter. Some infinite sets, such as Z, 
are unbounded, and therefore obviously lack maximum and minimum elements. 
The case of a bounded infinite set, such as the open interval (1,3), is a little 
subtler. The problem is that 3 and 1—the obvious candidates for biggest and 
smallest—are not members of (1,3). 


Countably infinite sets. The set N is the “model” countably infinite set, but many 
other sets turn out to have the same cardinality as N. The following proposition 
says, in effect, that N is the “smallest” infinite set. 


Proposition 1.17. Let S be countably infinite, and let T C S' be any nonempty 
subset. Then T is either finite or countably infinite. 


Notice the possible surprise: Every infinite subset of N—the odd numbers, the 
primes, the powers of 2, the powers of 123456789—is in the sense of cardinality 
just as “big” as N itself. 

The idea of the proof is to /ist all the members of T’: 


t1, to, t3,..., toga, -.-. 


Such a list defines a nice one-to-one correspondence: 1 +> 11, 2 +> te, ..., 
234 +> to34, and so on. The details follow. 
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We'll revisit some of these facts 
in the exercises. 


The closed interval [1,3] does 
contain maximum and minimum 
elements. 
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It’s discussed in Section 1.1. 


The proof is a little easier if 
either or both are finite. 


The matrix extends infinitely 
upward and to the right, like the 
first quadrant in the xy-plane. 


Find these entries in the big 
matrix. 
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Proof (sketch): By hypothesis, S is in one-to-one correspondence with N, so we 
may as well assume right away that S = N. and that T CN. If T is finite we’re 
done already, so we assume T is infinite. 

To finish the proof we need a bijective function f : N — T. To start, let 1 
be the smallest element of T' (the well-ordering principle guarantees that there 
is a smallest element), and set f(1) = t,. Now let tg be the smallest element 
of T \ {t1}, and set f(2) = to. Continuing, we set f(3) = t3, where ts is the 
smallest element of T \ {t1,t2}, and so on. This process continues indefinitely 
because T’ is infinite, and so does indeed construct a function f : N — T, as 
desired. The method of construction assures that the ¢; are all different from one 
another, and so f is one-to-one. The function f is also onto. If, say, t = 42 € T, 
then there are at most 41 smaller members of T, so f(k) = 42 must hold for some 
k; < 42. A similar argument works for every t € T’, so f is indeed onto, and the 
proof is done. 


New countable sets from old. Proposition 1.17 gives one way of creating count- 
able sets: start with a given countable set and take subsets. Propositions 1.18 and 
1.19 describe some other operations that leave countability intact. You may be 
surprised .... 


Proposition 1.18. Let A and B be countable sets. The Cartesian product 
Ax B= {(a,b)|aeAandbe B} 
is countable. 
Proof: We’\l take both A and B to be countably infinite. In this case, we have 
A= {a1, a2, a3,...} and B= {by, be, bs,...}, 


and we can list all members of A x B in a “doubly-infinite matrix”: 


(a1,61) (a2,b1) (@3,b1) (a4, 61) 


Our last trick is to “count off” all the entries in this gigantic array—without miss- 
ing any. One way to do this is to start at lower left and proceed along “northwest- 
pointing” diagonals: 


(a1, 61), (a2, b1), (a1, b2), (a3, b1), (az, b2), (a1, b3), (a4, b1), wees 


Do you see the pattern? What matters is that every point (a;,5;) appears once, 
but only once, in our list. These facts imply that the function f : N— Ax B 
given by n +> nth entry in the list is the desired bijection. 
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Proposition 1.19. Let A,, Ao, Az, ..., An, ...be a countable collection of sets, 
all of them countable. Then the union 


Ay U AU Ag U- ++ = [J Ai 
i=1 


is also countable. 


Proof (sketch): Again we assume that each A; is countably infinite, and list its 
elements: A; = {@;,1, @:,2, @:,3, ... }. As in the preceding proof, we can list all 
elements of all the A; in a big “matrix”: 


43,1 43,2 43,3 43,4 
42,1 42,2 42,3 42,4 


41,1 41,2 41,3 41,4 


Just as before, this “matrix” has countably many entries, so we’re almost done. 
But a minor subtlety needs attention: the A; might have some elements in com- 
mon, so some elements of the union might appear more than once in the “matrix.” 
Luckily, this turns out not to matter, because the distinct elements of the union cor- 
respond to some subset of the matrix entries, and we showed in Proposition 1.17 
that such subsets are countable. 


Corollary 1.20. The set Q of rational numbers is countable. 


Proof: It is enough, says Proposition 1.19, to write Q as a countable union of 


countable sets. For 7 = 1,2,3,..., let 
{3 1 -1 2 -2 \ 
Go) So ee Be Pe 
a a a re) 
be the set of fractions a/i, where a € Z. Now each A; is clearly countable—it is A given rational, say 3/7, 
in one-to-one correspondence with Z—and Q is the union of all the Aj. appears not just in A7 but also 


in Aj4, Az1, and so on. 


Uncountable Sets 


The preceding propositions imply that many apparently large sets, such as Q, are 
in fact no larger in cardinality than N. Might R itself be countable? The answer 
is hardly obvious, but it turns out to be no, as the German mathematician Georg 
Cantor showed around 1873. 


Theorem 1.21. R is uncountable. 
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We found explicit bijective 
functions between these 
intervals. 


Pun intended. 
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We’ll give a version of Cantor’s ingenious proof below, but first let’s consider 
some striking implications of the theorem: 


Corollary 1.22. The interval (0, 1) is uncountable. Every nonempty open interval 
(a, b) is uncountable. The irrational numbers are uncountable. 


All parts of the corollary say that the sets mentioned are, in cardinality, much 
larger than the comparatively “sparse” set Q. Throw a dart at random at the real 
line, and you’ll almost certainly hit an irrational. 


Proofs of the corollaries. In Example 1 we showed that the intervals (0,1), 
(—2,5), and (0, 00) all have the same cardinality. Similar methods show that all 
nonempty open intervals, including (—oo, co) (aka R), have the same cardinality. 
If the set P of irrationals were countable, then R = P U Q would be the union of 
two countable sets, and hence countable itself. 


Cantor's “diagonal” proof. Cantor uses the idea of infinite decimal expansion 

to show that the interval (0,1) is uncountable. The proof is by contradiction. If 
(0,1) were countable, we could list all of its elements in an infinite sequence: 
Y1, %2, ©3,.... Now each x; has an infinite decimal expansion, of the form 
x4 = 0.di1 diz diz dia dis ..., where the d;; are decimal digits, ranging from 0 to 
9. Thus, we can write 


1 = 0.d11 dy2 di3 dia dis... 
x2 = 0.d21 dz2 doz dz4 das ... 
x3 = 0.d31 d32 d33 d3q d35 ... 
x4 = 0.d4y daz daz daa das ... 
@5 = 0.d51 d52 d53 d54 d55 ... 


Here comes the clever part. Cantor uses the diagonal entries 


dy,1, d2,2, d3,3, daa, d5,5, .-- 


in this array to produce a new number xo = 0.e €2 €3, ..., which lies in (0, 1) 
but all of whose digits differ from the corresponding diagonal entry: 


ep Adi, e2A#do2, €3 #d33, e4 F daa, 


There are ten choices for each digit, so there are plenty of ways to choose the e;, 
and therefore countless possible numbers 79. What matters is that x differs from 
x1 in the first digit, from x2 in the second digit, from «3 in the third digit, and 
so on. Thus xp is nowhere among the original x;, which contradicts our original 
assumption. 


1.6. Sets 103: Finite and Infinite Sets; Cardinality 


Exercises 


1. Consider the functions f : (0,1) — (—2,5) and g : (0,00) — (0,1) 
defined in Example 1. (It is shown there that g is onto.) 


(a) Show that f and g are both one-to-one on their domains. 
(b) Show that f is onto. 


(c) Show that f o g is one-to-one and onto. What does this mean about 
cardinality? 


2. Let (a,b) and (c,d) be any two bounded intervals. Find a linear function 
f : (a,b) — (c,d) that is one-to-one and onto. (Give an explicit formula 
for f in terms of a, b, c, and d.) 


3. Let A be a set with 42 elements and B a set with 43 elements, and let 
f:A-— Bandg: B — Abe functions. Can f be one-to-one? Onto? 
What about g? Give examples of what can happen, and explain what can’t. 
How is the pigeonhole principle involved? 


4. Let S be a finite set. Use the pigeonhole principle to show that if f: S— S 
is one-to-one, then f is also onto. 


5. It can be shown that a set is S is infinite if and only if there exists a function 
f : S — S that is one-to-one but not onto. (The “if” part is the claim in the 
preceding problem.) 


(a) Find a function f : N — N that is one-to-one but not onto. (There are 
many possibilities.) 


(b) Find a function g : R — R that is one-to-one but not onto. (There are 
many possibilities.) 


(c) Find a function h : R > R \ {1} that is one-to-one and onto. 


6. Suppose that A ~ Band B ~ C. Use Definition 1.13 to show that A ~ C. 
7. Suppose A is infinite, B is finite, and B C A. Show that A \ B is infinite. 
8. Let f : N — Z be the function described by the table in Example 2. 


(a) Find a symbolic formula for f(n), perhaps using cases. 


(b) Let g : Z — N be the other function described by the table in Exam- 
ple 2 (read the table from bottom to top). Find a symbolic formula for 
g(z), perhaps using cases. 

(c) Show that both go f : N — Nand fog: Z — Z are identity 
functions. 
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9. 
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We proved Proposition 1.19 (Q is countable) by writing Q as the union of 
countably many countably infinite sets. We could, instead, write Q as a 
nested union of countably many finite sets. 


(a) For €N, let Q; be the set of rationals p/q (written in reduced form), 
for which p? + q? < i. How many members does Q3 have? What 
about Qi9? Can Q; = Qji41 for some 2? (Hints: Members of Q; 
correspond to some, but not all, of the points inside the circle p? + q? 
in the pq-plane. For Q3 and Qj, just count these points.) 


(b) Find an upper bound in terms of 2 for the number of elements in Q;. 
(c) Show that Qi C Q2 C Q3 C... and that U2, Qi = Q. 


Show that [0, 1] and (0, 1) have the same cardinality by finding a function 
f : [0,1] — (0, 1) that is one-to-one and onto. 


Hints: The claim seems reasonable, but finding a good function f is tricky. 
Here is one possibility: Set f(0) = 1/2, f(1) = 1/3, f(1/2) = 1/4, 
f(1/3) = 1/5, f(1/4) = 1/6, f(1/5) = 1/7, etc. For all other x, set 
f(x) = x. Think about it, draw a graph, etc.; show that this function does 
the job. (We’ll show later, by the way, that no continuous function f can 
have the desired properties.) 


. Show that [0, oo) and (0, 00) have the same cardinality. 


. Use results from these problems to explain why all intervals in R have the 


same cardinality. 


. A “book” is a finite string of characters from some finite “alphabet,” such 


as the 128-character ASCII system. Is the set B of all possible “books” 
finite, countably infinite, or uncountable? 


. A polynomial with integer coefficients is an expression of the form p(a) = 


ao + aya + agu? +--+ anx™, where nis a nonnegative integer and all 
the a; are integers. The set of all such polynomials is denoted Z[x]. A 
real number a is called algebraic if p(a) = 0 for some p in Z[z]. If, say, 
p(x) = x? — 2, then p(V/2) = 0, so V2 is algebraic. 


(a) Show that Z[:] is countable. 
(b) Show that every rational number is algebraic. 


(c) Show that the set of algebraic numbers is countable. 


(d) A real number that is not algebraic is called transcendental. Show that 
the set of transcendental numbers is uncountable. (Note: Showing that 
a particular number is transcendental is usually difficult. But “nearly 
every” number is transcendental.) 
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1.7 Numbers 102: Absolute Values 


Recall the familiar absolute value function: 


x if x > 0; 
|z| = 
—x ifx<0. 


In beginning calculus this function is a favorite example of mildly bad behavior— 
it’s continuous but has no derivative at x = 0. We’ll use the same example later, 
too, but the absolute value also plays a much more basic role in real analysis: it 
measures the size, or magnitude, of a quantity. Such questions—how big? how 
close?—are crucial in real analysis; so, therefore, are equalities and inequalities 
with absolute values. 

The absolute value has a nice geometric interpretation: 


If x and a are real numbers on a number line, then |x — a| is the 
distance between x and a. 


This simple idea is surprisingly helpful in making sense of expressions that in- 
volve absolute values. 


EXAMPLE 1. Which real numbers z satisfy (i) |x — 3| < |x + 2|? What about 
(ii) [Ba — 7] <5? 


SOLUTION. Both inequalities are slightly awkward algebraically because of the 
need to handle positive and negative cases separately. Distance interpretations can 
help—a lot. Inequality (i), recast as |a — 3] < | — (—2)|, now says simply that x 
is closer to 3 than to —2. This means, in turn, that « lies left of 1/2, the midpoint 
of (—2, 3); equivalently, « < 1/2. 

Inequality (ii) can also be interpreted in distance language, after a little rear- 
rangement: 


7 5 
32-7 <5 = 3 —= =. 
[3a —7| < x i)<3 


7 
—= 5S 
x ‘|< 


The last version makes the meaning clear: x lies within distance 5/3 from 7/3. 
Equivalently, x € (2/3,4). ©) 


Absolute Truths 


We collect some key properties of absolute values in two theorems. The first is 
easy; proofs are omitted or left as exercises. 


Make a sketch. 


Is all the algebra OK? 
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We explain the name below. 
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Theorem 1.23. Let x and y denote arbitrary real numbers; assume k > 0. 
(a) —|z| <a < |e. 
(b) |zl=|-a];  |e—yl=ly—al. 

(c) |ja|<k == -k<a<k. 

(d) |jaj >k == a«>k or a<-k. 

(e) |x yl = lel ly 

The next theorem is important enough to have its own name. 

Theorem 1.24 (Triangle inequality). Let x and y be any real numbers. Then 

jz + yl < |e] + [yI. 


The reverse triangle inequality says 


jz — y| > ||2] — |yl |. 


On proofs. The triangle inequality follows from several bits of Theorem 1.23. 
From (a), we get 


—|t)<a@<|xe| and —|yl<y<l|yl. 
Adding these inequalities gives 
—(la| + ly) =—-lel-lylsa@+y<(al+lyl), 


which is equivalent, by part (c) of Theorem 1.23, to |a + y| < |a|+|y|, as desired. 
The reverse triangle inequality can be derived from the standard triangle in- 
equality; see the exercises. 


Variations on a theme. The triangle inequalities are often used in forms slightly 
different from the “vanilla” versions in the theorem. Following are some exam- 
ples; observe occasional uses of Theorem 1.23: 


@  |e—yl=la+(—y)I < lel + lyls 
(i) [x—yl=|z-ata—y|<|e—al+la—y|=|e—al + |y—al; 
(iii) |x] — |y| < Je —y| < [2] + IyI. 
Note especially what (ii) says about distance: the trip from x to y is no longer 


than the sum of the distances from x to a and from a to y. This should sound 
reasonable—stopping at a enroute shouldn’t shorten the trip from x to y—and it 
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helps explain the allusion to triangles. Note also (iii), which traps |x — y| between 
upper and lower bounds. 

In fact, triangle-type inequalities hold in quite general mathematical settings, 
some beyond the scope of this book. The basic triangle inequality |a+y| < 
|z|+|y| holds for complex numbers « and y, for instance, if |a| denotes the length 
of x considered as a vector in the plane. Generalizing in another direction, related 
facts like 

[ty + ae +++++ a100| < |xi| + |ze| +++» + [2100] 


[ff teva < fifo) an 


also hold, and are sometimes called triangle inequalities. 


and 


EXAMPLE 2. Suppose we know that |~ — 7| < 0.01, |y — 7| < 0.02, and 
|z — 7| > 0.03. What can be said about |x — y|, |x — z|, |xy|, and |a/y|? 


SOLUTION. Triangle inequalities help. To estimate |x — y| we have 
ja —y|=|a-7+7-y| < |v—-—7|4+ |ly—7| < 0.01 + 0.02 = 0.03. 


For |x — z| there is no upper bound (why?), but the reverse triangle inequality 
gives a lower bound: 


|e — z| = |(@ — 7) — («-7)| => |z —7| — |x —7| > 0.03 — 0.01 = 0.02. 


To estimate |xy| and |x/y| we notice first that 6.99 < |x| < 7.01 and 6.98 < 
ly| < 7.02. Thus, 


7.01 
|zy| = |z||y| < 6.99 -6.98 = 48.79 and | sae <—— 1.004. 9 
y ly| — 6.98 


Exercises 


1. Find all real values of x that satisfy the given inequality; express answers 
in interval notation. 
(a) |2a+7| < 11 
(b) |2a+7| > 11 
(c) [207 +7| < 11 
(d) ju —3| < |a+1| 
(e) |x* — 4] < 0.07 
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These bounds follow from the 
triangle inequality; common 
sense works, too. 
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2. Suppose that | — 3| < 0.07 and |y — 5| < 0.04. 


(a) Show that |5a — 15| < 0.35. 

(b) Show that jaz + y — 8| < 0.11. 

(c) Show that 14.5328 < ry < 15.4728. 
(d) Show that |y — 4] > 0.96. 


io) 


. This problem is about the reverse triangle inequality (RTI). 


(a) Use the ordinary triangle inequality to prove the RTT. 


(b) Each side of the RTI can be interpreted as the distance between two 
numbers. What does the RTI say in this language? 


(c) Under what conditions on z and y is the RTI actually an equation? 


4. The triangle inequality (TI) for three summands follows from the ordi- 
nary TI: 


lay +22 + X3| = |(21 + Lo) + 23] < |x + xa|+|x3| < |z1|+|z2|+|z3]. 


Show by induction that the TI holds for n summands, where n > 2. 


Nn 


. The triangle inequality |@+ gy] < |z| + |y| also holds for vectors # and ¥ 
in the plane, where || denotes the /ength of Z. Draw a picture to illustrate 
this fact; make note of the triangle. When does equality hold? 


6. This problem explores a connection between the absolute value and the 
maximum or minimum of two quantities. 


(a) Show that if x and y are real numbers, then 


c+yt|e-yl 


max{xr,y} = 5 


(b) Find a similar formula for the minimum of x and y. 


(c) Let f : 1 — Rand g: I — R be functions defined on an interval 
I. Find a formula involving the absolute value for the new function 
h: I — R defined by h(x) = max{ f(x), g(z)}. 


(d) Let f(x) = sin and g(x) = e”, and let h(a) be defined as in the 
preceding part. Use technology to plot f, g, and h together in an 
interval that shows clearly what’s happening. 


7. Let € be a positive number, and suppose |x — 7| < € and |y — 7| < «. Show 
that |a — y| < 2e. 
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8. (a) Suppose that |a — 1| < 0.5. Show that x > 0.5. 


(b) Suppose that c 4 0 and |x — cl < tet Show that |x| > a 


9. Yet another triangle-type inequality has the form 


Lf fora 


Discuss, perhaps with a picture, why this is plausible. How are Riemann 
sums involved? (No formal proof is possible until we define the integral!) 


< i “| f(@)| dee. 


1.8 Bounds 


Theorems and proofs in real analysis often turn on boundedness. We’ll prove 
later, for instance, that a continuous function defined on a closed and bounded 
interval I = [a,b] is itself bounded on I. To make sense of such a claim re- 
quires clear meanings for all its words—which include two somewhat different 
instances of the b-word. In this section we start to unpack the idea and language 
of boundedness in several settings. 


Definition 1.25. Let S be a nonempty set of real numbers. 


© Sis bounded above if there exists a number / with s < M forall s € S; 
here M is an upper bound for S. 


° Sis bounded below if there exists a number m with s > m for all s € S; 
in this case m is a lower bound for S. 


© Sis bounded if it is bounded both above and below. 


It is clear, for instance, that all three of the sets {1,2,3}, [1,3], and (1,3) are 
bounded above by 3 and below by 1, while the interval (1, 00) has the same lower 
bound, but no upper bound. 


Here are some basic notes on the idea and language of boundedness: 


© Many choices: Upper and lower bounds are far from unique. For {1, 2,3}, 
[1,3], and (1,3), for instance, —42 and 42 also work as lower and upper 
bounds. These may seem unlikely choices, but in practice we sometimes 
care more about the existence (or absence) of bounds than about particular 
numerical values. 


We'll also need to define 
“continuous,” of course. 
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And similarly for sets © Unboundedness: A set is unbounded if it is not bounded. The definition is 
unbounded above or far from surprising, but deciding whether a set is bounded or unbounded 
unbounded Belo can be challenging. What do you think about 


1 A> St oh ae 
ag es re ee aera ca OR Cacti 
ts ig per Bern gs Be a \ 


and 


1 1 1 1 1 1 
feitpiesehitstataeef, 


for example? 


Boundedness and absolute values: Absolute values—which measure mag- 
nitude—have a natural connection to boundedness. Indeed: 


Fact 1.26. Let S C R be a nonempty set. S is bounded if and only if there 
exists M > 0 such that |s| < M forall s € S. 


We outline the (easy) proof in the exercises. 


EXAMPLE 1. Let S and T be nonempty sets of real numbers. Prove some basic 
properties of bounded sets: 

(a) If S Cc T and T is bounded, then S' is bounded, too. 

(b) If S is finite, then S' is bounded. 


(c) Let |.S| = {|s| | s € S}. Then S is bounded if and only if |.S'| is bounded. 


(d) LetS+T={s+t|seS,teThand ST ={st|seS,teT}. IS 
and T are bounded, then so are S + T and ST. 


SOLUTION. These facts follow directly from the definition of boundedness. 

Claims (a) and (c) are left to the exercises; claim (b) follows from the fact that 

every finite set of numbers has a largest and a smallest member. For (d), note first 

that, by assumption, there are positive numbers Mg and Mr with |s| < Msg and 
See the triangle inequality? \t] < Mr forall s € S andt € T. Hence we have 


Is +t] <|s|+|t]|<Ms+Mrp and |st| =|s||t] < MsMr 


for all s and ¢ in question, and the claims about boundedness follow. ©) 
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Getting Edgy: sup, lub, inf, and glib 


The interval J = (—3, 42], like every bounded set, has infinitely many upper and 
lower bounds. But there is something obviously special about —3 and 42: the 
former is the greatest lower bound and the latter the least upper bound for I. The 
following definition formalizes these properties—and introduces some impressive 
Latin synonyms. 


Definition 1.27 (Supremum and infimum). Let .S C R be anonempty set. A num- 
ber a is the infimum of S', and we write a = inf(S) (or a = glb(S$)) if 


(i) ais a lower bound for S$; and 
(ii) if a’ is any lower bound for S, then a > a’. 


A number (3 is the supremum of S, and we write 3 = sup(S) (or a = lub(S)) if 
similar inequalities hold for upper bounds. 


This may all seem, for now, like a lot of fuss over not much. Indeed, there is little 
interesting to be said about sups and infs of simple sets, like bounded intervals. 
We will see soon, however, that the existence of real sups and infs for more general 
bounded sets—a property called completeness—is essential in the theory of real 
analysis. Here, for the moment, are some simpler notes on the definition. 


® Do they exist? If S is unbounded above (or below), then, obviously, S has 
no supremum (or infimum). Every bounded set of real numbers, by con- 
trast, has a real supremum (3 and a real infimum a. We discuss this subtle 
property and its implications carefully in the next section. 


In or out? The interval J = (—3,42] contains its supremum but not its 
infimum; including or excluding intervals’ endpoints shows that all such 
combinations are possible. If a set S contains its supremum ( (or infimum 
a), then (3 is the maximum (or minimum) element of S. 


Only one? We said “the” in the preceding definition, and this is justified: a 
set S' has at most one supremum and one infimum. The proof is easy. If (31 
and (32 are both suprema for S, then we’d have both 3; < (2 and G2 < (4, 
and so G, = (2. 


Elbow room. Every member of J = (0,1) is positive, but J pushes right up 
against 0 in that members of J can be found within any given distance of 0. By 
contrast, I leaves 1.003 a little “elbow room”: no member of I is within 0.003 of 
1.003. This distinction, which comes up often in real analysis, can be described 
efficiently in terms of boundedness. 


Definition 1.28. If a is anumber and S C R is anonempty set, then S is bounded 
away from a if, for some 6 > 0, |s — a| > 6 forall s € S. 
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Any positive 6 smaller than d; 
works, too. 


sin x increases on this interval. 
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For I = (0,1) and a = 1.003, for example, we can use 5 = 0.003 (or any smaller 
value of 5). Just as clearly, no positive 5 works for J = (0,1) and a = 0. 


EXAMPLE 2. The interval J = (2,3) is bounded away from 7 because |m— x| > 
0.14 for all  € (2,3). By contrast, 3 ¢ I, but I is not bounded away from 3, 
since J has members within any small distance 6 from 3. The set N is bounded 
away from 7, but Q is not, since for any 6 > 0, no matter how small, there are 
rational numbers within 6 of 7. © 


EXAMPLE 3. Let S = {81,89,...,8n} be a finite set of real numbers, with 
a ¢ S. Show that S is bounded away from a. 


SOLUTION. The idea is that, since S is finite, one of the s; must be closest 
to a. More precisely, let d; = |s; — a| be the distance from s; to a. Then 
D = {di,d2,...,dn} is a finite set of positive numbers, and so has a minimum 
member, say d,. (Every finite set of numbers has largest and smallest elements; 
see Fact 1.16, page 49.) Now it is clear that 6 = d,/2 works in Definition 1.28. 
% 


Functions and Bounds 


Boundedness for functions is much like that for sets, but the jargon is slightly 
different: 


Definition 1.29. Let f be a real-valued function and A a subset of the domain of 
f. If f(a) < M forall a € A, then f is bounded above by M on A. If f(a) >m 
for all a € A, then f is bounded below by m on A. If f is bounded above and 
below on A, then f is bounded on A. 


EXAMPLE 4. Let 
f(x) =sinz, g(z)=2°-52?+7x+13, and h(x) =1/2. 


On which sets A are f, g, and h bounded or unbounded? 


SOLUTION. The function f is least interesting. Because —1 < sin(x) < 1 for 
all real x, f is bounded (above by 1 and below by —1) on every set 
A CR. Sharper bounds can be found on some smaller sets, of course. If, say, 
A = (0.36, 1.27] and a € A, then 0.353 © sin 0.36 < sina < sin 1.27 = 0.955. 
We know (informally for now) from elementary calculus that g(a)—like every 
odd-degree polynomials—“blows up” for large (positive or negative) values of x. 
Thus, g is unbounded on unbounded sets like [0, 00), (—0oo, 42), and R itself. On 
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the other hand, g is bounded on every bounded set A. If, say, A is the interval 
[—K, K] and a € A, then we have 
|g(a)| = |a® — 5a? + 7a +: 13] < Ja®| +5 ]a?| +7 Jal +13 
< K345K?4+7K +13; 
the last quantity is an upper bound. (With more work we might find a smaller 
upper bound.) 
The function h behaves somewhat differently: although bounded on some 


infinite intervals, such as [0.00017, 00) and (—co, —0.24), h is unbounded on 
some finite intervals, like (0, 1) and (—0.24, 0). ©) 


Exercises 
1. Consider the sets EW of all non-hyphenated English words and WITP of 
all words in this problem. Let f be the function with rule 
f (word) = number of letters in word. 
(a) Is f bounded above and/or below on EW? Can you give good upper 
and lower bounds? Explain. 
(b) Find upper and lower bounds for f on the set WITP. 


2. Consider the real-valued function g whose domain is the set GCES of gram- 
matically correct English sentences, and has rule 


g(sentence) = number of words in sentence. 


Is g bounded on GCES? Can you give upper and lower bounds? What 
bounds apply g on the subset SITP of sentences in this problem? 


3. We suggested in Example 4, page 62, that every bounded set S C R is 
contained in some interval of the form [—K, K’]. Prove this. 


4. Consider the interval J = [—10, 10]. Find good upper and lower bounds for 
each of the following functions defined on J. Use the triangle inequality or 
ideas from elementary calculus. 


1 
(a) f(z) = er, 


(b) g(x) = 3a? + 2a — 7. 
(c) h(x) = sin? x + cos? x + x. 


5. Do the preceding problem, but replace the interval J with the interval J = 
[(—K, K], where K > 0. (Answers may depend on K, of course.) 


Thanks, triangle inequality. 
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. Let f(z) = Ax + B, where A and B are any real constants, and let I be 


the interval [—K, K], where K > 0. Find sharp (i.e., best possible) upper 
and lower bounds for f on J. (Hint: Handle the cases A > 0, A = 0, and 
A < 0 separately.) 


. Prove parts (a) and (c) of Example 1, page 60. 


. Is each of the following sets bounded? If so, give upper and lower bounds. 


If not, why not? (It’s OK to use ideas from elementary calculus.) 


(a) {uit tient 
1 
(b) {ais see} 


In1l In2 In3 In4 
(c) § —, ——, —,— .... 


| ie aes mee! 


Dds 923. 9% 
(d) \neg Ee} 
(e) {tan(x) | « € R} 
(f) {c €R|2* —5a* + 7x — 1234 = 0} 


. Show that if S is bounded and f(x) = ax + b, then f is bounded on S. 


. We remarked in this section that a set S of real numbers is bounded if and 


only if there is some M > 0 such that |s| < M forall s € S. 


(a) Suppose S has upper and lower bounds a and b, respectively. Find an 
M that satisfies the condition above. 


(b) Suppose M satisfies the condition above. Find upper and lower bounds 
for S. 


(c) Prove the statement at the beginning of this exercise. 


. Let S C Rbe a nonempty set with 0 ¢ S. 


(a) Can S' be both (1) unbounded; and (ii) bounded away from 0? Either 
give an example or explain why none is possible. 


(b) Show that S' is bounded away from 0 if and only if the set T = {1/s | 
s € S} is bounded. 


(c) Show that S is bounded away from 0 if and only if there is an open 
interval J = (a,b) withO € Jand SOI =9. 


In each of the following cases either give an example or say why none can 
exist. 
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(a) A set of positive numbers that is bounded away from one but not from 
Zero. 


(b) A number a such that Q is bounded away from a. 


(c) A set S that is bounded away from a = 42 but not bounded away 
from any other integer. 


13. This problem is about the intervals J = [0,1] and J = (2,3). 


(a) Show that if a ¢ I, then I is bounded away from a. 
(b) Does J have the property described in (a)? Why or why not? 
(c) Does N have the property described in (a)? Explain. 


14. Show that if S C R isa finite set, then S contains both sup(S) and inf(S). 


15. Let S C R be bounded, with 6 = sup(S). Show that S is not bounded 
away from (3. 


1.9 Numbers 103: Completeness 


There are many, many rational numbers. Indeed, between any two particular 
rationals lie infinitely many others. Between zero and one, for instance, we find 


111 1 1 

2’ 3? 4’ "” 507’ 50877" 
Between any two of these, say 1/507 and 1/508, lie infinitely many others; and 
between any two of those .... Who could ask for more? 

We could, and we do. There are just not enough rational numbers out there 
to permit the sort of analysis—or even algebra—that we want to do. We have 
seen, for instance, that there is no rational, exact solution to the simple-looking 
equation x? = 2, even though rational numbers like 1.4142135 (too small) and 
1.4142136 (too big) come very, very close. 

The real numbers have no such gaps. Somewhere between all those too- 
small rational numbers x (with x? < 2) and the too-big rational numbers y (with 
y” > 2) is a real number r that deserves the name \/2 because r? = 2 exactly. 
We will take this “gap-free” property, known technically as completeness, as an 
axiom: a defining property of the real numbers rather than a claim that requires 
proof. 


The Completeness Axiom for R. Every nonempty set of real numbers 
that is bounded above has a supremum. In symbols: If S Cc R, 
S # ), and S is bounded above, then there is a real number (3 such 
that 6 = sup(S). 
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See the exercises. 


Rational numbers are also real. 


The question boils down to 
whether 152399024 is a perfect 
square. It isn’t... quite. 
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Like other succinct mathematical statements, this one can use some unpacking: 


* What about infs? For brevity, the completeness axiom mentions only sups. 
But the story is similar for infs: Every nonempty set that is bounded below 
has an infimum. The inf version, moreover, follows easily from the sup 
version. 


° The real advantage: The completeness axiom guarantees that every bounded 
set of rationals has a supremum—which may or may not be a rational num- 
ber. By contrast, the supremum of a bounded set of reals must be real. In 
this sense R is complete, while Q is incomplete. 


° Inor out? We know already that the sup and the inf of a given set may 
or may not lie within the set. It is pretty obvious, for instance, that the 
half-closed interval (1, 2] contains its supremum but not its infimum. For 
more complicated sets, things may be less clear. Consider, for example, 
the set S = {r EQ\r?< 152399024}. A little thought shows that S' 
is bounded, so the completeness axiom guarantees that a sup and an inf 
exist. We might also guess (correctly!) that sup(S') = /152399024 and 
inf(S) = —vV 152399024, but do these numbers lie inside or outside $? 
This is far from clear at a glance—and the completeness axiom is no help 
at all. To decide, we’d need to discover whether 152399024 is rational, 
and this takes a little effort. 


Using Completeness: Order Properties of 


R 


The following theorem collects some useful properties of the real numbers as an 
ordered set. Most are familiar; the surprise, if any, is the role that completeness 


plays in their proofs. 


Theorem 1.30 (Order properties of numbers). 


¢ N is unbounded: Given any real number M, there exists a positive integer 


nwithn > M. 


* Squeezing in: For every positive number €, no matter how small, there ex- 


ists a positive integer n such that 0 < + <e 


© The Archimedean principle: Given real numbers a and b withO0 <a < b, 


there exists a positive integer n with na > b. 


More on the Archimedean principle. The Archimedean principle can be thought 
of geometrically. The challenging case occurs when a is small and b is large, as 


suggested in Figure 1.2. 
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LS NDNDIND™N LS NTIN 
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Figure 1.2. Archimedes in motion: a long trip in short steps. 


The principle says that steps of size a, no matter how short, will eventually 
complete a trip of length b, no matter how long. As Archimedes (around 250 BCE) 
might have put it, a positive number a, no matter how small, is not “infinitesimal”: 
successive multiples a, 2a, 3a, ... will eventually exceed any proposed bound b. 


Proof: To show that N is unbounded, assume toward contradiction that no integer 
n exceeds MM. Then / is an upper bound for N, and so, by the completeness 
axiom, N must have a real supremum, say (3. Because /3 is the least upper bound, 
@ — 1 is not an upper bound, and so no > 3 — 1 must hold for some positive 
integer no. But then no + 1 > £, which contradicts our assumption that no 
integer exceeds (3. Thus N is unbounded, as claimed. 

The remaining claims turn out to be lightly disguised versions of the result 
just shown. For the Archimedean principle, note that, for positive a and 0, 


b 
na>b = n>-. 
a 


But we’ve just shown that N is unbounded, so the right-hand inequality must hold 
for sufficiently large n. 
Proof of the “squeezing-in” principle is left as an exercise. 


Nested Intervals 


A collection of intervals [,, [2, [3,... is called nested if 


hOB BS hye 


Here is a nested collection of open intervals: 


(0,1) 2 (0.5) > (0.5) 5 (0.5) ae 


Observe that, although each interval J,, contains infinitely many points, the inter- 
section of the full collection is empty. 
The story is different for closed intervals. The nested collection 


(0,1) 2 0.5 > 0.5 > 0.5 Ses 
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Recall: Every nonempty finite 
set of reals has a maximum 
element. 
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has exactly one point of intersection—the number 0. The following theorem de- 
scribes the situation in general; completeness is the key. 


Theorem 1.31 (The nested intervals theorem). Consider a nested infinite col- 
lection 
Ih Dlg Dlg DIy2D... 


of closed and bounded intervals. The intersection 
io Tes (iene: 


contains at least one point. If the intervals’ lengths shrink to zero, then the inter- 
section is a single point. 


Proof (sketch): We sketch the main idea, leaving some details to exercises. If we 
write I, = [an, b,,] for all positive integers n, then the nesting condition means 


ay <ag < ag <-++ < b3 < bg < by. 


In particular, the “left endpoint set” A = {a1, a2, a3,...} is bounded above. By 
completeness, we can set a = sup A, and it is readily shown that a lies in all the 
I,,. A similar argument shows that 3 = inf{b,, be, b3,...} lies in all the [,. If 
the lengths of the J,, shrink to zero, then a = 7. 


Rationals and Irrationals: Tightly Packed 


We mentioned casually at the beginning of this section that between any two given 
rational numbers lie infinitely many other rational numbers. A little more is true, 
and now we can prove it. 


Theorem 1.32. Let x and y be any two real numbers, with x < y. 
(a) The interval (x, y) contains at least one rational and one irrational number. 


(b) The interval (x,y) contains infinitely many rationals and infinitely many 
irrationals. 


Proof: Claim (b) looks much stronger than (a), but showing that (a) implies (b) 
is surprisingly easy. Let’s suppose, toward contradiction, that (a) holds but (a, y) 
contains only finitely many rationals 71, rg, ..., Tn. Then one of these, say rn, 
is largest. But then the interval (r,,, y) must contain no rationals, which contra- 
dicts (a). Exactly the same proof applies to irrational numbers, so we conclude 
that (a) implies (b). 

To prove (a) we’ll use Theorem 1.30. For convenience, we’ll handle here only 
the special case 0 < x < y, and mop up remaining cases as exercises. If we 
set € = y — @, then the squeezing-in part of Theorem 1.30 says that for some 
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integer n the rational number r = 1/n lies in the interval (0, €)—as does the 
irrational number p = r/V2. 

To finish the proof, we show that at least one of r, 2r, 3r, 4r, ... (all are 
rational) and at least one of p, 2p, 3p, 4p, ... (all are irrational) lie inside (2, y). 
This is intuitively reasonable—for both sequences, the “jumps” are too small to 
miss (x, y) entirely. To put it formally, say for p, 2p, 3p, 4p,..., consider the 
set S = {np | n € Nandnp < y}. By the Archimedean principle, np > y 
for sufficiently large n, which amounts to saying that S is finite. Hence S has a 
largest element, say nop, with nop < y but (no + 1)p > y. But now 


(ntlp>y = nmply-—p>y—ly—2) =, 


and so x < nop < y, as desired. The proof that x < mgr < y for some positive 
integer mp is almost identical. 


Exercises 


1. We said in this section that completeness of R guarantees that the number 
two has a real square root. Let’s prove it. 


(a) Let S = {x | x? < 2}. Explain why S has a least upper bound; call 
it GB. 

(b) We’d like to show that 8? = 2. First suppose toward contradiction 
that 3? < 2, and set h = (2 — 6?)/5. Show that (3 + h)? < 2, and 
derive a contradiction. 


(c) Prove by contradiction that 3? > 2 is also impossible. (Hint: Set 
k = (6? — 2)/4 and consider (3 — k.) 


2. Use the completeness axiom for sups to prove the analogous result for infs: 
if S C R, S # OG, and S is bounded below, then there is a € R such 
that a = inf(S'). (Hint: Given a nonempty set S, look at the new set —S' 
defined by —S = {—s | s € S}. Apply the completeness axiom to —S and 
interpret the result.) 

3. The list 4, 3, i, we 2 i ... suggests an explicit “recipe” for an endless 
collection of rational numbers between 0 and 1. 


(a) Give a similar recipe for an endless list of rationals between 0 and 
1 
507° 
(b) Give a similar recipe for an endless list of rationals between 4 and 3. 


(c) Let a and b be any two rationals, with a < b. Give a recipe (involving 
a and b) for an endless list of rationals between a and b. 


Note thatO0 << p<r<y-x. 
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. Use the unboundedness of N (the first part of Theorem 1.30) to prove the 


squeezing-in principle (the second part of Theorem 1.30). 


. Our statement of the Archimedean principle (part of Theorem 1.30) in- 


cludes the hypothesis 0 < a < b. 


(a) Is the claim true if 0 < b < a? Give a proof or counterexample. 


(b) Is the Archimedean claim true or false if we replace the hypothesis 
0<a< bwitha < b? What if0 4a < b? 


. Part (a) of Theorem 1.32 says that every interval (x, y) contains at least one 


rational and one irrational. The proof given above assumed that 0 < x < y. 


(a) Show that the same result holds if z < 0 < y. (Hint: Apply the result 
already proved to the new interval (X,Y) = (0, y).) 


(b) Show that the result still holds if 7 < y < 0. (Hint: Look at the new 
interval (X,Y) = (—y, —2).) 


. The completeness axiom says this: if.S C R is bounded above, then S has 


a supremum (3, and 3 ER. 
(a) Is the italicized statement true or false if R is replaced (in both places) 
by Z? Explain. 


(b) Is the italicized statement true or false if IR is replaced (in both places) 
by Q? Explain. 


. In each case, find a nested collection J; D In D ... of intervals as 


described. 


(a) Each I, is open, the J, are nested, and the intersection is the single 
point 3. 


(b) Each I, is closed, no two J, are equal, and the intersection is the 
interval [—3, 42]. 


(c) Each J, is open, no two J, are equal, and the intersection is the inter- 
val [—3, 42]. 


9. This problem is about some details in the proof of Theorem 1.31, page 68; 


see the notation there. 


(a) Prove that, for all m and n, am < by. 
(b) Prove that a < b,, for all n. 
(c) Prove that, for alln, an <a < 9 < by. 
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10. It is well known that every number has an infinite decimal expansion. For 
instance, we can write 


: = 0.3333333...; am = 3.1415926...; 42 = 42.0000000.... 


Here is a slightly subtler fact: every infinite string of decimal digits 
corresponds to a unique real number 3. Use the completeness axiom to 
explain why. (Hint: For convenience, consider only strings of the form 
0.d,dzgd3dads ...., where each d; is a decimal digit. Use these digits to 
construct a bounded set with supremum /3.) 
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Sequences and Series 


2.1 Sequences and Convergence 


An infinite sequence is a list a1, a2, a3,... of real numbers, indexed by the posi- 
tive integers. Here are some examples: 


111 1 

Q1, G2, 43, G4,.--, G42,--. = lroog Too 

bi, be, b3, b4,..., ba2,.-- = 1,-1,1,-1,...,—-1, ; 

C1, C2, C3, C4.--, Cag,--. =  Sinl, sin2, sin3, sind, ..., sin42,...; 
ti, fo, fs, fa, ---> faa,--. = 1,1, 2,3, ..., 267914296,...; 

123 5 433494437 

Gis 92, 93; Gay +++) G4dy--. = T? 1’ 2? 3°°°°? 967014006 
Observe: 


© Words and notations: A particular entry a,, in a sequence is called its nth 
term. Notice that the subscript, or index variable name, is arbitrary: a, and 
a; and even a, can all mean the same thing. For the entire sequence we 
write {a,,}°°.,, orjust {a,, }; the braces emphasize the fact that a sequence 
is a set. 


Sequences as functions: A sequence {a,,} is, among other things, a real- 
valued function, with domain N. Writing, say, a(42) rather than a4 can 
help emphasize this viewpoint. 


© Sequence rules: Like other functions, sequences are often defined by sym- 
bolic rules. Among the sequences above, for example, we have the rules 


Qa, =1f/n, bp =(-1)"", and ¢,=sinn. 


Not every sequence (or function, for that matter) has such a simple recipe. 
The sequence { f,,} above is the famous Fibonacci sequence, which is de- 
fined recursively: 


figHly fe=1; fa=fr-ot faa ine 3. 


The sequence {g,,} can be built (can you guess how?) from { f,,}. 
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The ay notation is legal but 
unwise, since x usually denotes 
a “continuous” variable. 
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Calculating some decimal 
values might help us guess. 


2. Sequences and Series 


* Converge or diverge? It is clear at a glance that {a,,} converges to the limit 
zero. Sequences {b,,} and {f,,} presumably diverge, because the former 
never settles on any single limit, while the latter blows up in size. Precise 
definitions, which we’ll give in a moment, will formalize these intuitions 
and lead to rigorous proofs. Whether {c,,} and {g,,} converge or diverge 
may be harder to guess. 


Convergence and Divergence 


Informally speaking, a sequence {a,,} converges to a limit, say 3, if the numbers 
Gy, “approach” 3 as n “tends to infinity.’ Such an intuitive view can be useful, 
but it’s too vague to permit careful proofs. The words in quotes are especially 
important—but especially unclear as written. We need a formal definition: 


Definition 2.1. Let {a,,} be a sequence and L a number. We say {ay} converges 
to L if for every e > 0 there exists a number JN so that 


la, — L|<e whenevern > N. 


In this case L is the limit, and we write limy—. Gn = L, or just a, — L. If no 
such L exists, we say {ay} diverges. 


The definition is subtle, and deserves some unpacking: 


* Greek letters: The Greek letter « (lower-case epsilon) is traditionally used 
in mathematics to denote small positive quantities. Later we’ Il use 6 (lower- 
case delta), as well, for similar purposes. 


* © measures “nearness”: The inequality |a, — Z| < e€ means that a,, is 
within € of L. If € is very small then a,, is very near to L. 


¢ N measures “waiting time”: The definition requires that every term a, with 
n > N be within « of L. The number N says how long we need to wait 
for this happy outcome. If, say, € = 0.001 and N = 234.7, then we’re 
guaranteed that a235, G36, @237, and all following terms are within 0.001 
of the limit. 


¢ Symbolic shorthand: The heart of the definition can be written compactly 
in symbols: 


Ve >0 INEN suchthat n>N => |a,—-L|<e 


Using the definition: notes on proofs. Our concise definition can lead, ideally, 
to equally concise proofs. But such polished products can seem both impressive 
and mysterious, like a Ferrari with the hood up. To raise the hood a bit, we’ ll 
often precede formal proofs with informal discussion—but keep the two separate. 
We start with some positive and negative examples. 


2.1. Sequences and Convergence 


EXAMPLE 1. Prove that the sequence {a,,} with a, = 4 converges to zero. In 
symbols, limy—oo + = 0. 


DISCUSSION. The fact that a, — 0 seems obvious; how could it be otherwise? 
To invoke the definition, we work with the desired inequality, |a,, — L| < ¢. Here 


we have 
1 


n 


1 1 1 
=-, and —<e = n>-. 
n n € 


lan — L| = 


(The first calculation works because—but only because—both n and € are posi- 
tive.) Now we’re getting somewhere: N = 1/e does what the definition requires, 
and we’re ready for a slick and seamless proof. 


PROOF. Lete > 0 be given. Set N = 1/e. This N “works” because if n > N, 


then 
1 


n 


1 — 
N 


which is what the definition requires. ©) 


1 
la, — L| = =-—< 
n 


EXAMPLE 2. Let a, = (3n 4+ 2)/(n +5). Show that {x,,} converges. 


DISCUSSION. First we need a candidate for L; a little thought or plugging in 
large numbers suggests L = 3. As in Example 1, we manipulate the desired 
inequality in search of a suitable N. This time we need a little more algebra: 


ene 3n+2 4 _ [8nt+2—3(n+5)/ | -13] 18 
i ~ | n+t5 - n+5 ~In+5) n+5 
and 
13 


13 13 
<e = —<nt+d = —-5K<n. 
n+5 € € 


So N = 8 — 5 does the job, and we’re ready for another brief proof. 


PRooF. Let « > 0 be given. Set N = 8 — 5. This N “works” because if 


n > N, then (omitting some routine algebra in the second equality) we have 


| | 3n + 2 3 13 e 13 13 
wn a — _ SS. ————S a Es 
n+5 n+5 ~N+5 18-545 
just as the definition requires. © 
EXAMPLE 3. Show that the oscillating sequence by, ba, b3, b4,--- = 1, —1, 1, 


—1,... diverges. 


Clearing out pesky absolute 
value bars is a big help. 


Again the annoying absolute 
value eventually vanishes. 
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Proving this is another matter. 


2. Sequences and Series 


DISCUSSION. As in Example | the claim is hardly surprising. The new twist is 
to prove a negative: No number L works in the definition. One approach involves 
the fact that no single number L can be close (within 0.01, say) to both 1 and —1. 


PRoor. Assume, toward contradiction, that {b,,} converges to any number L. 
Let € = 0.01 and choose WN as in the definition. If no is any odd integer with 
no > N, then we have 


lbn, — L| = |1-—L| < 0.01 => Le (0.99, 1.01). 
Similarly, ifn is an even integer with n, > N, then 
lb,, — L] = |-1-—L| < 0.01 => Le (—1.01, —0.99). 


Thus L lies in two disjoint intervals, which is absurd. ©) 


Seeing Sequences and Convergence 


Like ordinary graphs in a calculus course, geometric views of sequences and con- 
vergence help us visualize their behavior—good and bad. Different views are 
possible. 


Points on a line. Selected terms of a sequence {a,,} can be plotted as labeled 
points on a line. The resulting diagrams are crude and limited, but they can illumi- 
nate properties of sequences and convergence. Figure 2.1, for example, illustrates 
the e—N condition for convergence: All terms a,, with n > N lie within ¢ of L. 


Points in a plane. Terms of {a,,} can also be viewed as points (n, a») on the 
graph of the function a : N — R. Figure 2.2 offers glimpses of two sequences, 
{1/n} and {sin}. One converges to zero (as we’ve proved!); the other looks un- 
likely to converge to anything. Such graphs can reveal a lot, but never everything— 
a sequence has infinitely many terms, and a picture shows only a tiny sample. 
Sequence graphs can also show how « and N interact. The idea is to express 
the key definition in graphical language: for any « > 0, no matter how small, 
there is some N on the (horizontal) n-axis so that all graph points fo the right of 
N lie inside an “e-band” around the horizontal line y = L, as Figure 2.3 suggests. 


a, a, a, A535 Ay38 ay net) yt) a, a, 
Se ——!. 
L-e E L+e 


Figure 2.1. Convergence: a one-dimensional view. 


2.1. Sequences and Convergence 


10 20 30 40 50 


(a) A look at the sequence {1/n} 


10F 


0.5 


-0.5F 


(b) A look at the sequence {sin n} 


Figure 2.2. Graphical views of two sequences. 
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Figure 2.3. What convergence means graphically. 


Properties of Sequences 


Sequences, like other real-valued functions, may or may not have various behav- 


ioral properties. Here are some typical definitions: 


Definition 2.2. Let {a,,} be a sequence of real numbers. 


* Boundedness: {a,,} is bounded above if there exists M such that a,, << 
for all n € N; it is bounded below if there exists m such that a, > m for 


all n. 


* Monotonicity: {ay} is increasing if dn < Qn41 for alln € N; {ay} is 
decreasing if Gy > Gn+41 for alln € N. {a,} is monotone if it is either 


increasing or decreasing. 


® Strictness: {a,} is strictly increasing if an, < Gn+41 for alln € N. {ay} is 


strictly decreasing if dn > Qn+41 foralln € N. 
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Nice to see it turn up again. 


We could play the same game 
with smaller «, but there’s no 
need. 


In Example 2, we found N= 
13/e -—5. 


2. Sequences and Series 


The sequence {1/n}, for example, is strictly decreasing and bounded (above and 
below), while the Fibonacci sequence is increasing and unbounded (above). 

The following two theorems link these properties to convergence and diver- 
gence. Neither statement is surprising, but the formal proofs are nice exercises— 
and left as exercises—in using the definition. Some pre-proof discussion follows 
each result. 


Theorem 2.3. If a sequence {a,,} is (i) monotone, and (ii) bounded, then {ay} 
converges. 


Almost-a-proof discussion. Let’s assume that {a,,} is increasing; a similar proof 
works for the decreasing case. Since the set {a,,} is bounded, it has a supremum, 
by the completeness axiom. Let’s call the supremum L, and show that L is the 
limit, too. To this end, let « > 0 be given; we’re done if we find any N that 
“works” for this €. 

Here’s the trick. Because L is the supremum, L — e€ is not an upper bound 
for {a,,}, and so there must be some term, ay, with L — € < ay. The punchline 
is that this N “works”: If n > N, then a, > ay because {a,,} is increasing; 
therefore, we have 

DLD-€<an <a,<LD<L+e, 


as desired. 


Theorem 2.4. If a sequence {ay} converges, then {ay} is bounded both above 
and below. 


Discussion. Suppose {a,,} converges to L. If we set « = 1, then—by the defini- 
tion—there is some N such that D—1 < a, < L+1 forall n > N. In particular, 
the “upper tail” {anv41,@n+42,4N+3,---} is bounded (above by L + 1 and below 
by LZ — 1). Now the “front end” {a1,a2,...,an} is a finite set, and hence also 
bounded. Thus the entire sequence {a,,} is the union of two bounded sets, and is 
therefore bounded. 


Beyond the basics: inequalities. The sequences in Examples 1 and 2 led to 
relatively simple inequalities and, in turn, to relatively simple recipes for N in 
terms of €. Sequences with more complicated definitions may require more clev- 
erness. Fortunately, a little creativity with inequalities can dramatically simplify 
the work. Following are two examples. 


EXAMPLE 4. Show that the sequence {z,,} with 


1 
n+ JY/n4+14+5 


zn = 


converges to zero. 


2.1. Sequences and Convergence 


DISCUSSION. But for the annoying denominator {z,,} resembles {1/n}, which 
converges to zero (see Example 1). To show that z,, — 0, too, for given € > 0, 
we need to find N for which 


1 1 
SS -- ee oo KE 
| n+V¥n+1+5 


whenever n > N. This looks clumsy, but a simple inequality brings radical 
improvements: 


l2zn — O| = 


1 1 
Zn, — 0| = —————— _ < -. 
lan — 0| ntV¥n+1+5 =n 
Now it is easy to see that 
1 1 
-—-<€ Sno o-, 
n € 


which means that N = 1/e “works.” We assemble the parts in the concise proof. 


PROOF. Let « >0 be given. Set N = 4. Ifn > N then 


1 1 
—————— CX 
esl n+V¥n+1+5 


as the definition requires. ©) 


ei 
N 


1 
len — 0] = = < <i =e 
n 


EXAMPLE 5. Let wp, = 4n/(2n — 85); show that {w,,} converges to 2. 


DISCUSSION. This time the key quantity |w,, — L| takes the form 


4n 170 
jon — L| =|, 2] =| 
2n — 85 2n — 85 
we want 
fife nt 3 2 Ore 
io ~ |2n — 85 


to hold for large n. It would be nice to drop the absolute value—alas, the de- 
nominator, 2n — 85, may be negative. The good news is that this happens only 
for n < 42. If n > 42, then we can indeed drop the absolute value in good 
conscience, and solve our inequality without undue fuss: 


170 170 Be ee 170 ai 85 ee 
= —+—<n. 
In—85| In—85 ~~ de 2 
The last inequality is what we wanted—a value of n beyond which |w, — L| < e. 
Here comes the proof. 


Check each step; note the key 
inequality. 


Check the final calculation. 


But check all algebra carefully. 
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PROOF. Let e > 0 be given. Set N = +o + s. (Note that N > 42.) This N 
works, since ifn > N, then we have 


since n> 43 2 = 5 — =; ve 
n— n— 
170 170 


a nice collapse! Se 
QN—85 Wye5—e5 


as the definition requires. ©) 


Exercises 
1. If a sequence {x,,} converges, then for any given « > 0 there is some N 
that “works” in the sense of the definition. For the sequence {1/n} the 
following table shows values of N associated with values of e: 
e | 1.000 0.100 0.010 0.001 
N 1 10 100. ~=1000 


Make a similar table (same values of €) for each sequence following. Try 
to choose N as small as possible. 


(@) {4} 
© {Fe} 
© {rm} 


2. This problem is about e—N tables for convergent sequences, like those in 
the preceding problem (with « = 1,0.1,0.01, 0.001 in the first row). 


(a) Explain why all entries in the second row can be the same. 


(b) Consider the sequence {x,,} with x, = 0 for all n. What goes in the 
second row? 


(c) Consider the sequence {y,} with y, = 1/d(n), where d(n) is the 
number of base-ten digits in n. (For example, y123456 = 1/6.) What 
goes in the second row? 


3. Suppose that {,,} converges to 5. 


(a) Show that there is some N such that x, € (4.9,5.1) for alln > N. 
(b) Show that there is some N such that x7, > 4.999 for alln > N. 
(c) Show that the set {x,, | 2, > 6} is finite. 
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4. Suppose that {x,,} does not converge. 
(a) Is it possible that x,, = 3 for infinitely many n? If so, give an example. 
If not, explain why not. 
(b) Is it possible that x, = 3 for all but finitely many n? If so, give an 
example. If not, explain why not. 
5. Suppose that {x,,} converges to 4. 
(a) Is it possible that x, # 4 for infinitely many n? If so, give an example. 
If not, explain why not. 
(b) Is it possible that z,, > 4 for infinitely many n and x, < 4 for 


infinitely many n? If so, give an example. If not, explain why not. 


6. Suppose that {,, } is unbounded above. Use the definition of convergence 
to show that {x,,} does not converge to L = 1000. 


7. Prove that each of the following sequences converges. (Example 2, page 75, 


is similar.) 
2 

(a) {an}, with an = woe 

2n + 300000 
b) {b,,}, with b,, = —————_— 
Dae A 3n+5 

: 2n 

(c) {cen}, with c, = ane EB 


8. Guess and then prove a limit for each sequence following. (Examples 4 and 5 


may be helpful.) 
2n 

nS ith a 
(a) {an}, with a oe 
(b) {b,}, with by, = an 

we " 3n+sinn+5° 

: n  2n 

(c) {cn}, with c, = (—1) ante 5 


9. Convert the informal discussion after Theorem 2.3, page 78, into a concise 
proof that a bounded, decreasing sequence converges. 


10. Convert the informal discussion after Theorem 2.4, page 78, into a concise 
proof. 
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11. In this section we mentioned the Fibonacci sequence { f,,}, defined by f; = 
fo =1and fr = frn—2+ fn—1 for n > 3. Itis clear that { f,,} is unbounded, 
but how fast does { f,,} increase? We explore this question in this problem. 
Let’s show first, by induction, that f, < 2” forn > 1. Because the Fi- 
bonacci sequence is defined using two previous terms, it is convenient to 
take both n = 1 and n = 2 as base cases. 

The claim is obvious form = 1 and n = 2. For the inductive step, we 


suppose the claim holds for all n up through & (with & > 2) and show, as 
follows, that it holds forn =k +1: 


fia = fei + fe < 2 1 +O" < oP oF = oO, 
as desired. 


(a) Show by induction that f,, > 1.5” forall n > 11. (It is easy to check 
with technology that the inequality is false for smaller n.) 


(b) It can also be shown by induction (as in the preceding problem) that 
fn > 1.6” for all “sufficiently large” n. Use technology to decide 
which n have this property. 


12. Consider the sequence {g,,} defined by g; = 1 and gn414 =1+4+ oan 


(a) Write out the first few terms of the sequence to see the Fibonacci 
numbers pop up. 


(b) Prove by induction that g,, < 2 for all n. 


(c) Observe (with technology) that {g,,} appears to hop back and forth 
across the number @ = (1+/5)/2 ~ 1.618 (this is the famous golden 
ratio). Prove this by showing (algebraically; no induction needed) that 
(i) if g, > @then gn4y1 < ¢; and (ii) if gn < d then gn+1 > od. 


13. Here is a second, “unofficial” definition of convergence to a number L: 
{rp} converges to L if, for every € > 0, we have |x, — L| < ¢ for all but 
finitely many n. 


(a) Show that {1/n} converges to zero in the sense above. 


(b) Show that if x, — 0 in the sense above, then x,, — 0 in the “official” 
sense defined in this section. 


(c) What does it mean for a sequence {,, } not to converge to zero in the 
sense above? (Negate the unofficial definition.) 


(d) What does it mean for a sequence {x,, } not to converge to zero in the 
official sense? (Negate the official definition.) 


2.2. Working with Sequences 


14. 


From any given sequence {z,,} we can form the related sequence {y,,} = 
{52,+2}. Use the definition of convergence to show that if {x,, } converges 
to 42, then {y,,} converges to _____.. (First fill in the blank.) 


. Some sequences {2 } have e—N tables (in the sense of Exercise 1, above) 


of the following form: 


e | 1.0 0.1 0.01 0.001 0.0001 0.00001 


Which sequences are these? 


. Some sequences {2 } have e—N tables (in the sense of Exercise 1, above) 


of the following form: 


e | 1.0 0.1 0.01 0.001 0.0001 0.00001 


Which sequences are these? 


. Show that every real number ( is the limit of an increasing sequence of 


rational numbers. (Hint: Use the idea of infinite decimal expansion; see 
Exercise 10, page 71.) 


2.2 Working with Sequences 


New Sequences from Old 


As with other functions, “old” sequences can be combined in various ways to 
form “new” ones. Naturally enough, convergence and divergence properties of a 
built-up sequence reflect properties of the building blocks. 


Theorem 2.5 (Algebra with convergent sequences). Let {a,,} and {b,,} be con- 
vergent sequences. Let a, b, and c be real numbers, with ay, — a and b,, — b. 


* Sums and differences: The sequence {ay + bp} converges to a + b. 


* Constant multiples: The sequence {cay} converges to ca. 


® Products: The sequence {dy - by} converges to a- b. 


* Quotients: If b 4 0 and b, # 0 for all n, then the sequence {ay/bn} 


converges to a/b. 


Let’s use the theorem first; proofs come later. 
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Find your own example where 
{an + bn} diverges. 


2. Sequences and Series 


EXAMPLE 1. In Example 2, page 75, we used the definition of convergence to 


show that { me } converges to three. Could Theorem 2.5 have helped? 


SOLUTION. Yes. Basic algebra gives 


8n+2  34+2/n  3:142-4 
n+5  14+5/n  14+5-2 


> 


which shows the original sequence as a combination of the much simpler se- 
quences {1}, and {1/n}, which converge to 1 and 0, respectively. Now Theo- 
rem 2.5 implies that 


3-14+2-4 3-142-0 5 
—S —— SC 
1+5-+ 1+5-0 ; 


as expected. 

We just assumed, of course, that {1} and {1/n} do indeed converge to one 
and zero. These claims need proof, too, but the proofs are easy—and need to be 
done just once. ©) 


EXAMPLE 2. Let {a,} and {b,,} be divergent sequences, and let {cp} be a 
convergent sequence. Can the sum {a,, + b,,} converge? Can it diverge? What 
about {a, + cn}? 


SOLUTION. Theorem 2.5 says nothing about sums of divergent sequences. But 
easy examples show that {a,,} and {b,,} can either converge or diverge. With 
Gy, = nand b, = 7 — n, for example, we get a, +b, = 7 for all n, so {an + bn} 
converges. 

Theorem 2.5 does help with {an + Cn}; let’s call it {d,,} for the moment. If 
{d,,} were convergent, then the difference {d, — cy} = {a} would converge, 
too, contradicting our assumption. Thus, {a,, + c,} diverges. © 


EXAMPLE 3. A sequence {s,, } is defined recursively by 


Sn—-1 


= 2.0 d sn, = 
S] an Ss 5) rar 


The first few terms (rounded) are 2.0, 1.5, 1.41667, 1.41422, 1.41421, .... What 
is happening here? Why? 


2.2. Working with Sequences 


SOLUTION. The numbers suggest convergence to \/2, but can we be certain? 
The answer is yes—if we invoke appropriate theorems. The definition implies 
that s,, > 0 for all n. It’s also true (and proved in exercises) that {s,, } is decreas- 
ing, and so Theorem 2.3, page 78, implies that {s,,} converges to some limit L. 
Invoking different parts of Theorem 2.5 lets us find L. Because s,, — L, we know 


Sn-1 L 1 
n— = L=- >> 
7 es aie 
which implies, in turn, that L?=2,orL= v2. >) 


Proving Theorem 2.5. Theorem 2.5 contains few surprises. Concerning sums, 
for instance, it should seem reasonable that if a, ~ a and b,, = b for large n, then 
also dy + by, & a+ b for large n. Rigorous proofs are another matter, of course. 

The definition of convergence is all about inequalities; therefore, so are con- 
vergence proofs. Here, we’ll need to show that inequalities like 


(Qn + bn) —(a@+6)| <€, |anbn —(a+b)|<e, and <e€ 


An 
bn 


ae) 


hold for large n. What we have to work with are 
lan —al<e and |b, —b| <e, 


which do hold for large n, by hypothesis. The trick is to parlay these simpler 
inequalities into proofs of their more complicated cousins. The triangle inequality 
will come in very handy; watch for it in the following proof. 


Proof (for sums): Let € > 0; we need to choose N so that |(@, +bn) — (a+b)| < 
€ whenever n > N. Set <’ = €/2; note e’ > 0. Because a, — a, there is a number 
N, that “works” for e’: 


n>N, = |an-—al<e. 
Similarly, there exists N2 such that 
n>Ny => |bn—-dl <eé. 
Now let N be the larger of N; and No. Ifn > N, then bothn > N; andn > No, 
and so 
(an + bn) — (a + b)| = |(an — a) + (bn — 5) 
< lan —alt+ |b, —b] << e+e =e, 


where we used the triangle inequality at the line break. This shows that N works 
in the desired sense, and completes the proof. 
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L = —V2is clearly impossible 
since all Sp are positive. 


Watch for a technical trick, too. 
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We'll see this again. 


And left as an exercise. 


2. Sequences and Series 


Working with products. To prove our claim about products we will use the cor- 
responding properties for sums (just proved) and for constant multiples (left as an 
exercise). The proof starts with a standard analyst’s trick: subtracting and adding 
the same quantity: 


An Dn = Anby — Anb + anb = an(bp — b) + and. 


The result for constant multiples shows that a,b — ab; we'll be done if we can 
also show that a,,(b, — b) — 0. The key insight, as we will see in the formal 
proof, is that the {a,,} are bounded. 


Proof (for products): First we write anbn = An(by — b) + and. By the result for 
sums, it’s enough to show that 


anb—> ab and ay(bpn — 6) > 0. 


That a,,b — ab follows from the property of constant multiples that we assumed. 
To show a,,(b, — b) — 0, observe first that the convergent sequence {a,} is 
bounded (Theorem 2.4, page 78, says so); in other words, there exists IZ > 0 
with |a,,| < M for all n. Now let € > 0 be given; note that «/M/ > 0, too. Since 
bn — 6, there exists N such that |b, — b| < ¢/M when n > N. This N works 
for {an(bn — b)}, because if n > N, then 


€ 


|a@n(bn — 6) — O| = |an| [on — b| < M [bn — 6] < Ma 


— ¢, 


as desired. 


Quotients. To show a,,/b, — a/b requires—even to make sense—that b 4 0 
and b,, ~ 0 for all n. If we assume this, it is enough to show that 1/b,, > 1/0, for 
then the result for products gives a, -1/b, — a-1/b, as desired. We will outline 
a proof that 1/b,, > 1/b in the exercises. 


Squeezing. The algebraic methods of Theorem 2.5 don’t immediately help with 
the sequence {sinn/n}, which has a trigonometric ingredient. Still, we expect 
the sequence to converge to zero because the numerator remains tamely bounded, 
while the denominator “blows up.” More precisely, we know 


for all n. Both the left- and right-hand sequences converge to zero, and we expect 
them to “squeeze” the middle sequence to the same limit. That intuition is correct: 


Theorem 2.6 (The squeeze principle). Let {an}, {bn}, and {cn} be sequences 
such that an, < bp < Cp forall n. Ifa, — Land cy, — L, then by, — L, too. 


2.2. Working with Sequences 


Proof: For given € > 0, we need to find N such that n > N implies |b,,—L| < «, 
or, equivalently, D —e <b, < LD+e. 
Because a,, — L we can choose Nj, so 
n>N, = L-ex<a,<Lte. 
Similarly, there is Nz so 
n>No = L—-éeé<o<Lte. 


Now let N = max{N,, No}. This N works for {b,,}, since if n > N then 


D-—€<an <bn << Dt+e, 


as desired. 


Theorem 2.6 points to additional “squeeze-like” properties of sequences. 


Proposition 2.7 (More squeezing). Let {an}, {an}, and {by} be sequences and 
La number. 


(a) 2, -0 <=> |z,|— 0 
(b) If an — L, then |a,| — |L}. 
(c) If |an| — 0 and {by} is bounded, then aby, > 0. 


Proof: Claim (a), left as an exercise, is a basic application of the definition. 

Claims (b) and (c) follow from (a) by judicious squeezing. For (b), notice first 
that a,, — L — 0 by hypothesis, and so ja, — L| — 0, according to (a). The 
reverse triangle inequality now gives our squeezing inequality: 


0< flan! — [LI < lan — L| 


Since the right-hand sequence tends to zero, so must the middle one, as we aimed 
to show. 

For (c), we first use boundedness of {b,,} to choose B > 0 with |b,,| < B for 
all n. This leads to another squeezing inequality: 


0 < |anbn| < Blan|. 


Again, the left- and right-hand sequences tend to zero (the constant multiple B 
does no harm) and therefore squeeze the middle sequence to the same limit. 
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Here one sequence is constant. 
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The German Hilfsatz, or 
“helping sentence,” is more 
descriptive. 
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Sequences to order. Sequences with specified properties can often be made to 
order, as the following sample suggests. We’ll call it a lemma because such se- 
quences are often used in proofs of other results. (Lemmas are junior-grade theo- 
rems, normally used to prove something else.) 


Lemma 2.8. Let L be any number. There exist sequences {r,,} and {p,}, both 
converging to L, with r, € Qand p, ¢ Qfor all n. If desired, {r,} and {pn} 
and can be chosen to be either strictly increasing or strictly decreasing. 


About proofs. If L is rational, then the sequences defined by 


v2 


1 
Tm =L+—- and pp=Ll+— 
n n 


are strictly decreasing and converge to L. Other cases are left to the exercises. 


Divergence to Infinity 


Sequences like {n} and {—n3} clearly have no finite limit, but they misbehave 
less seriously than, say, the zig-zag sequence {1, —2,3, —4,...}. The following 
definition formalizes these ideas. 


Definition 2.9. A sequence {2,,} diverges to infinity if for every M > 0 there 
exists a number NV such that 


Zn > M_ whenevern > N. 
In this case, we write limp —.o9 Lp = 00, Or just Lp, — OV. 
Observe: 


© Divergence to —oo: A similar definition holds for divergence to —oo: For 
given M > 0 there must exist NV with x, < —M whenevern > N. 


° Big M, small e: For ordinary convergence we focus mainly on small posi- 
tive values of €; here the challenging values of M are large. 


* Convergence or divergence? Some authors refer to convergence to 00; we 
prefer to reserve “convergence” for finite limits. 


© Unboundedness and divergence to infinity: An unbounded sequence need 
not diverge to -too, as {1, —2,3, —4,...} illustrates. An unbounded mono- 
tone sequence, on the other hand, must diverge to -oo; see Example 5. 


* Zero or infinity? A sequence {x,,} of positive terms diverges to oo if and 
only if {1/2,,} converges to 0. (Proofs follow directly from the definitions; 
see the exercises.) A similar result holds for negative sequences. 


2.2. Working with Sequences 


Swept away: a comparison test. Here’s an unsurprising claim about comparing 
one sequence to another: 


If a, < b,, for all n and a, — ov, then b,, — ox, too. 


(A similar claim holds for sequences tending to —oo.) Proofs follow easily from 
the definitions; see the exercises. The surprise is how powerful this straightfor- 
ward idea can be in applications. 


EXAMPLE 4. The sequences {h,,} and {s,} given by 


1 11 1 ieee Sligee se 1 
Soret ag ithes theta fe 


ae oe as Lo is Jn 


are positive and increasing. Do they diverge to infinity? 


SOLUTION. Ina word, yes. Observe first that s,, > hy, for all n, so it is enough 


to show that h,, — oo. One approach is to recall from elementary calculus that 


eter eae +i [S 1 
noertaitite te —=Inn. 

de Qe 3 n 1 & 
It is well known that Inn — oo, and so both {s,,} and {h,,}, being larger, are 
also “swept away” to oo. 


EXAMPLE 5. Prove concisely that if {,,} is unbounded and increasing, then 
Ln — OO. 


SOLUTION. Let M > 0 be given. Since {x,,} is unbounded, there exists N 
with zy > M. This N works in the definition: if n > N then, since {2,,} is 
increasing, we have x, > xn > M, as desired. © 


Exercises 


1. Prove the following part of Theorem 2.5, page 83: If a, — a and c is any 
constant, then ca, — ca. (Hint: The result is trivial if c = 0, so assume 


c#0.) 


2. Another part of Theorem 2.5, page 83, says that a,,/b, — a/bif b #4 0 and 
bn # 0 for all n. To complete the proof sketched in this section, we need to 
show that 1/b,, > 1/0. 


(a) Explain why lz -—-]= 7 lb, — OB}. 
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If using unproved calculus 
results seems like cheating, see 
the exercises for another 
approach. 
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(b) We will show just below that the sequence {1/|b,,b|} is bounded, say 
by M > 0. Assuming this for the moment, prove that 1/b,, > 1/0. 
(Hint: The preceding identity gives | + — +| < Mb, —b|. Now argue 


as in the proof for products of sequences.) 
(c) Show in two steps that {1/|b,.b|} is bounded. 


(i) Show that the set {b,,} is bounded away from zero—i.e., there 
is some 6 > O such that |b,| > 6 for all n. (Hints: Let « = 
|b|/2 > 0. Since b, — 6b there is an integer N such that 
|b, — b| < e when n > N; note that |b,,| > |b|/2 for all such n. 
The remaining set {|b1|, |b2|,..., |bw |} is a finite set of positive 
numbers, and so has a positive minimum. Use these facts to find 
a suitable value of 6.) 


(ii) Show that 1/|b,,b| < 2/(6|b|) for all n . 
3. Prove part (a) of Proposition 2.7: t, ~0 <= > |r| — 0. 


4. This problem is about Lemma 2.8, page 88. 


(a) What does the lemma say if L = /2? 
(b) Find a strictly decreasing sequence {p,,} with p, ¢ Q for all n and 


Dn > V2. 


(c) Describe how to find a strictly increasing sequence {r,,} with r,, € Q 
for all n andr, — J/2. 


5. Consider the sequence {s,,} in Example 3, page 84. Show by induction that 
{s,,} is strictly decreasing; i.e., show 5,41 < S$, foralln > 1. 


6. Consider the sequence {s,,} given by 


i. <a 1 
=o +t +—. 


sn = 7 a ii 


We showed in Example 4, page 89, that {s,,} diverges to infinity. Give 
another comparison proof, this time with the sequence {,/n}. 


7. Consider again the sequence {h,,} in Example 4, page 89. Observe that 


1 1 1 1 
| ee a Cee ran Rr eae ae 
2 Ta gg ee eS, T's 


Use this idea to prove (without calculus) that {h,, } diverges to infinity. 


2.2. Working with Sequences 


8. The sequence {.S,,} given by 


is clearly increasing, so it either converges or diverges to infinity. Which 
happens? To decide we work with the similar series {T,, } given by 


1 1 1 1 


T, = —— ——_—_—-. —————— see —_—. 
141° 242 32437 arama 


(a) Use technology to evaluate some terms of both {S,,} and {T;,,}. Do 
they appear to converge or diverge? 


(b) Guess a simple formula for T;,; prove your guess by induction. 
(c) Conclude from the preceding part that T;, — 1. 


(d) Prove that {.S,,} converges to some limit L < 2. (Hint: Show first 
that S,, < 2T,, for all n > 1.) 


9. Let {x} be a sequence of positive terms. 


(a) Prove that if x, — oo, then —x,, — —oo. (The converse is also true, 
and the proof is almost identical.) 


(b) Show that x, — oo if and only if 1/2, — 0. 


(c) State and prove a similar claim for sequences of negative terms. (Use 
the first two parts.) 


10. Suppose a, > 0 for all n and a, — 3. Show that ,/a, — J/3. (Hint: 
Show first by algebra that |,/a, — V’3| < Jan — 3}.) 


11. Use theorems from this section (not the definitions of convergence) to dis- 
cuss each of the following limits. It is OK to assume such basic facts as 
1/n — 0, but say what you’re assuming. 


(a) jenn 


5n + cosn 
{vn +1} 
(c) {Vn +n—n} 


n? + arctann 
n+ ~  n+2 


(d) 
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2.3 Subsequences 


Basic Ideas 


Any given sequence 21, %2,%3,%4,... has many, many subsequences: new infi- 
nite sequences, such as 


U1, U3, U5, U7,---, U1, V4, VQ, V16,---, and 123, 124, V125, V126, +--+; 
formed by choosing, in order, any infinite subset of the original x,,. The last 
example could be called an upper tail subsequence, formed simply by skipping 
over an initial string. Order matters, and repetitions aren’t allowed, so 


U2, U1, U4, U3, UE, U5, --- and U1, U1, U2, 2, U3, V3, .--- 


are not considered subsequences of the “parent” {x,, }. 
Subsequences may or may not behave much like their parents. If, say, 


U1, 2, 13, T4,°°+ = 1, 2,1, 2,..., 
then {2,,} has no single limit, and therefore diverges. But the subsequences 
%1,%3,%5,°°-=1,1,1,... and 2, %4, %,--- = 2, 2, 2,... 
obviously converge to 1 and 2, respectively. Still other subsequences, such as 
%3, U6, 9, T12,--. and 21, X10, L100, T1000, - - - 
may or may not converge. By contrast, the sequence {y,,} with 
Y1; Y2, Y3; Y4, °° = 1.0, 0.1, 0.01, 0.001, ... 

converges to zero—and so does every one of its subsequences, including 

Y1, Y3, Y5, ++. Y83,--- and Y1, Yio, Y100, ---5 Y1041, +++ 

Stranger examples are possible. 

EXAMPLE 1. (A walk along the rationals.) Because Q is countably infinite, it 


can be listed as a sequence 71, T2, 73, 74, .... Is there a subsequence consisting 
entirely of positive integers? Must some subsequence converge to zero? 


2.3. Subsequences 


SOLUTION. Yes and yes. Lacking rigorous definitions, we’ll argue (very!) in- 
formally. 

To find a subsequence of positive integers, imagine walking from left to right 
along the rational sequence, underlining each term that happens to be a positive 
integer. The result might look like this: 


T1, 72, 73, 74, 75, 76, --+5 723, T24, 725, +++, T1234, T1235, ---- 


The process never stops because the supply of positive integers is infinite, so we 
have our desired subsequence: 


12, 73,124,71235 +--+ 


To find a subsequence that converges to zero, recall first that for every positive 
€, no matter how small, the interval (—e, €) contains infinitely many rationals. To 
find our desired subsequence, therefore, we walk again from left to right along the 
original sequence. At some position, say a9, we find ag € (—1,1). Continuing 
our rightward walk, we find, say, 


a7 € (—0.1, 0.1), 137 € (—0.01,0.01), 2988 © (—0.001, 0.001), 


and so on. We never need to stop because, at any stage, we’ve left behind only 
finitely many of the infinite family of residents of (—e, €). ©) 


Similar reasoning shows that any listing {r,, } of Q has subsequences consist- 
ing entirely of prime numbers or entirely of fractions with denominator 424242. 
It is less obvious—but true—that {r,,} has an increasing subsequence with limit 
7m, a decreasing subsequence with limit e, a decreasing subsequence of integers 
diverging to —oo, and countless other subsequences of interest. We sort out such 
possibilities in this section. 


Formalities. The idea of subsequences is simple enough but the notation takes 
some getting used to. To create a subsequence from a given sequence {a,,} means 
to choose a strictly increasing sequence of subscripts 


ny<ng<ng <M <N5<..., 
and use them to form the subsequence 


any» Ane» Ans» ana, ans» eee 


If, say, n1 4, n2 7,n3 11, 14 29,..., then the subsequence has the 
form 


G4, 47, G11, 429, ..-- 


The entire subsequence is denoted {a,,, }; note the double subscript. 


Coming soon. 
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A bit laboriously. 
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The process can be described precisely in the language of functions. Recall, 
first, that a sequence {a,,} is a function a : N > R, with a(1) = ay, a(2) = ag, 
etc. From this viewpoint a subsequence is formed by composition, as the formal 
definition says: 


Definition 2.10 (Subsequence). Let {a,,} be a sequence, regarded as a function 
a:N—R. A subsequence {ay,, } of {a,} is a composite functionaon : N > R, 
where n : N — N is any strictly increasing function. 


Observe: 


° A useful inequality: The definition implies a simple inequality that is use- 
ful in proofs: nz, > k for all k. A formal proof might involve induction 
or the pigeonhole principle, but the idea is mainly common sense—the ‘th 
term of a subsequence can’t appear before the kth term of the parent se- 


quence. 

© Another subsequence: The subsequence indices 1, n2, 3, N4,... for 
{an, } form still another subsequence—a strictly increasing subsequence 
of {1, 2,3, ...}. 


EXAMPLE 2. Let {a,,} be a sequence. Interpret the subsequences 
G1, 43, 45, @7,... and doa, G4, a6, Gg, ... 


in functional language and notation. What can be said about {a,,} if both subse- 
quences converge to three? 


SOLUTION. For the “odd” subsequence we have n(1) = 1, n(2) = 3, and 
n(42) = 83; in general, n(k) = n, = 2k — 1. Thus, 


aon(k) = a(n(k)) = a(2k — 1) 


for all k. In ordinary sequence notation, we can write {a,,,} = {a@2~—-1}; notice 
that k, not n, is now the index variable. For the “even” subsequence similar 
reasoning gives {an} = {aox}. 

If both subsequences {a2,—1} and {a2,,} converge to three, then it is reason- 
able to expect the parent sequence {a,,} to do so as well. To prove this let « > 0 
be given. By hypothesis, there exist numbers /(; and [¢2 such that 


k>oky => |ad2n-1 — 3| <€ and k>Ko => laor — 3| <e. 


Now it turns out that V = max {21, 22} “works” for € in the parent sequence. 


2.3. Subsequences 


To see why, suppose n > N. If n = 2k happens to be even, then n = 2k > 
2K 2, and so k > Ko, which implies that 


lan — 3] = laa, — 3] <e. 
If n = 2k — 1 is odd, thenn = 2k —1 > 2K ,andsok > Kj, and we see 
lan — 3| = |aan-1 — 3] <e. 


Thus we have |a,, — 3| < € forall n > N, and the proof is complete. ©) 


Properties of Subsequences 


Every infinite sequence {2 } has countless subsequences. What can be said about 


such an enormous set? The answer is, “quite a lot’: subsequences often “inherit” 
their parents’ basic properties. 


Theorem 2.11. Let {x,,} be a sequence and L a number. 


(a) If {xn} converges to L, then every subsequence {2y,,} converges to L, too. 


(b) If {a} diverges to +00, then every subsequence {Xn,,} diverges to +00, 
too. 


(c) If {xn} has subsequences converging to different limits, then {x,,} di- 
verges. 


About proofs. Key to both proofs is the fact, mentioned above, that, for any sub- 
sequence, n;, > k for all k. To prove (a), let € > 0 be given. By hypothesis there 
exists NV that works for {x,,} in the sense that |x, — L| < ¢ whenever n > N. 
The key fact above implies that the same N works for {%»,,}; if k > N then 
nz >k > N,and so |rp, — L| < ¢, as desired. 

The proof of (b) is similar, and (c) follows immediately from (a). Further 
details are left to exercises. 


Monotone subsequences. Following is a striking property of every sequence; 
we’ll make good use of it below in proving this section’s marquee result, the 
celebrated Bolzano—Weierstrass theorem. 


Proposition 2.12. Every sequence has a monotone subsequence. 


Proposition 2.12 is easy to state, but it’s a bit tricky to prove anything about all 
sequences—convergent or divergent, bounded or unbounded, tame or wild. Here, 
for instance, is a randomly-chosen stretch (from n = 125 to n = 130) of terms 
from the sequence {n cos(n7) }: 


..., 35.617, —4.847, 126.746, —106.052, —128.983, —25.531, .... 


Uncountably many, in fact. 


“Nontrivial,” in math-speak. 
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Do you see why? 


Do you see why? 
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It is hard to see much pattern there, but Proposition 2.12 guarantees that some- 
where in there is an increasing subsequence, a decreasing subsequence, or maybe 
both. 

We'll sneak up on a proof of Proposition 2.12 through two technical lemmas 
about special cases. 


Lemma 2.13. Every unbounded sequence {x»,} has a monotone subsequence 
that diverges to Loo. 


Proof (sketch): For a sequence {x,,} unbounded above, we’ll find an increasing 
sequence {zy } with x,, > k for all k. Since {x,,} is unbounded above, we can 
choose n1 SO Z,, > 1. Now the “upper tail” 


Xny+15 Xn +2) In +3) In +4) ais 


is also unbounded above, and so we can choose ng with ng > ny, and zr, > 
max{2,2,,}. Then we choose ng > nz with xn, > max{3, Xp, }. Continuing 
this process produces the desired subsequence. A similar construction produces a 
strictly decreasing subsequence if {2,, } is unbounded below. 


Lemma 2.14. Let {x,,} be a bounded sequence, with infimum a and supremum (3. 
Ifa €¢ {xy}, then there is a decreasing subsequence {%p,} with in, > a. If 
B € {xn}, then there is an increasing subsequence {Xp} with Xp, — P. 


Proof (sketch): To save a little labor we'll use Lemma 2.13 and some fancy alge- 
bra. In the case 6 ¢ {x,,}, we define a new sequence {y,,} by the rule 


= 1 
ar a 


Now {yn} is unbounded above, and so Lemma 2.13 guarantees that there is an 
increasing subsequence {yp } with yn, — co. But this implies that 


1 
Unk 


=P -—&n, — 0. 


This implies, in turn, that {ap } is increasing, with limit 6. A similar argument 
applies ifa ¢ {rp}. 


At last we can prove Proposition 2.12. Watch for the completeness axiom to 
pop up, and for Lemma 2.14 to provide the key technical insight. 


Proof (of Proposition 2.12): Let {an} be our sequence. If {x,,} is unbounded, 
we’re done, by Lemma 2.13. So we’ll assume that 


(i) {a} is bounded, and (ii) {x,,} has no increasing subsequence, 


and use these assumptions to construct a decreasing subsequence. 
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To get started, let 3, = sup{x1, 72,x73,...}. (Assumption (i) and the com- 
pleteness axiom guarantee that (3, exists.) The key observation is that (3; is in the 
set {@1, %2,%3,...}. Otherwise, says Lemma 2.14, some increasing subsequence 
would converge to 3,. Thus, we can find n1 with x,, = (31; this is the first term 
of our desired subsequence. 

To continue, we set 


Bo = sup {€n, 41; Un +2,Tn143,-++- ie 


Again, (2 exists by the completeness axiom, and Lemma 2.14, applied this time 
to the upper tail sequence 


Ln 41, Un,4+2; Un14+3, Uni +4---, 


implies that G2 = xp, for some nz > ny}. It is clear, too, that G2 < (1, as Bo 
bounds a smaller set than does (3;. 

Continuing this process indefinitely produces the desired decreasing subse- 
quence {%n,, Un .,Lng,--- }, and the proof is complete. 


A big theorem. It is now easy to prove a famous theorem, which we’ll use re- 
peatedly. We’ve done all the hard work. Starting in the very next section. 


Theorem 2.15 (Bolzano—Weierstrass). Every bounded sequence {X»,} has .acon- 
vergent subsequence. 


Proof: By Proposition 2.12, {x,,} has a monotone subsequence {2,,, }, which is 
also bounded because {x,,} is. By Theorem 2.3, page 78, every monotone and 
bounded sequence converges. 


Exercises 


1. The Bolzano—Weierstrass theorem (BWT) says that if a sequence {,,} is 
bounded, then {2,,} has a convergent subsequence. 


(a) State the converse and the contrapositive of the BWT. Disprove the 
false one. 
(b) Consider the sequences {y,} and {2} defined by yn = sinn and 


Zn, = ==". What can be said about their subsequences? Is the BWT 


helpful? Is it needed? 

(c) Show (assuming the BWT) that every sequence {x,,} has either (i) a 
convergent subsequence, or (ii) a subsequence that diverges to +oo. 
Can both (i) and (ii) occur? 
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2. Give an example in each part as indicated; no proofs needed. 


(a) A sequence with subsequences converging to 1, 2, and 3. 


(b) A convergent sequence with a subsequence of positive terms and an- 
other subsequence with negative terms. 


(c) A sequence with subsequences converging to 0 and to oo. 


(d) A positive sequence with monotone subsequences converging to 0 and 
to oo. 


(e) A sequence of positive integers with subsequences that converges to 
every positive integer. 


3. Because Q is countably infinite, we know that it can be written as a se- 
quence. 


(a) Explain why the set of nonnegative rationals can also be written as a 
sequence, say, {py}. 
(b) Give a reason why {p,,} cannot be monotone. 


(c) Although {p,,} itself is not monotone, it must have a monotone sub- 
sequence of positive integers. Explain why. 


4. Let {x,, } be any sequence of numbers and its upper tail subsequence {zp,, } 
given by 24242, 4243, ©4244, .--. Show that x, — L if and only if 


Ln, > L. 


nin 


5. Consider the sequence {2,,} defined by x, = (—1)">75. 


(a) Find the subsequences {x2,} and {2,1}. What are their limits? 
(No proofs needed.) What does the answer imply about convergence 
of {x,,}? 


(b) Let {ap,} be any subsequence of {x,,}. Show that if {x,,} con- 
verges, then it must converge either to 1 or to —1. 


6. Let {x} be a sequence that does not converge to three. 


(a) Show by an example that z,, = 3 can hold for infinitely many n. 


(b) Show that there exists some € > 0 and a subsequence {x,,, } such that 
|2n, — 3| > € forall k. 


7. Show that if {x,,} diverges and xo is any number, then there exists « > 0 
and a subsequence {x,, } such that |x, —%o| > € for all k. (In other 
words, some subsequence {2,,, } is “bounded away from xo.) 


2.4. Cauchy Sequences 


8. This problem is about Theorem 2.11, page 95. 


(a) Prove Theorem 2.11(b). 


(b) State the contrapositive of Theorem 2.11(a). How is this related to 
Theorem 2.11(c)? 


9. Let {x} be a sequence and x a number. Show that x, — 2o if and only 
if pn, — Xo for every monotone subsequence {Xp, }. 


10. Suppose the sequence {2,, } converges to 79. Show that there exists a sub- 
sequence {2p } with |a,, — 20| < 1/10* for all k. (In other words, some 
subsequence converges rapidly to x9.) 


11. Let {x} be a sequence and x a number. 


(a) Show that x, — 2p if and only if for every « > 0 the set {n | x, ¢ 
(ap — €, 20 + €)} is finite. 

(b) Show that some subsequence {x } converges to Xo if and only if for 
every € > 0 the set {n | tp € (ao — €, 2 + €)} is infinite. 


2.4 Cauchy Sequences 


The obvious way to prove that a sequence {2,, } converges is to know or guess a 
limit, say L, and then show that x, ~ L for large n (in the precise sense of the 
definition). For some sequences no obvious candidate for L presents itself, but 
we may know instead that the x,, are close to each other for large n. Consider, 
for instance, the sequence {s,,}: 


1 be tT 1 ty 1 oil 
3° 3. OB BA: Od ee os 
Here are approximate numerical values for the first few s,,: 


1.00, 0.667, 0.867, 0.724, 0.835, 0.744, 0.821, 0.754, 0.813, .... 


Successive s,, appear to jump back and forth across smaller and smaller intervals. 
Does {s,,} approach some limit? 

In this particular case the answer is classical. Our sequence {s,,} is derived 
from the famous Leibniz series 


1 1 1 1 1 1 1 
a5 719 nu’ B ae: 
which has been known for at least 300 years to converge to 7/4 ~ 0.7854. More 
generally, {s,,} illustrates the notion of a Cauchy sequence, named for the French 
mathematician Augustin—Louis Cauchy (1789-1857), a key figure in the devel- 
opment of real analysis. 
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Note where we use n> m. 
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Cauchy Basics 


The formal definition describes precisely what we mean by “close to each other 
for large n.” 


Definition 2.16. A sequence {x,,} is Cauchy if, for every « > 0, there exists N 
such that 
|2n —XLm|<e€ whenevern >m>N. 


Observe: 


° Similar, but different: The definition resembles that for convergence, but 
with a key difference: no limit L is mentioned. 


° Not just consecutive terms: The definition requires that all terms with large 
index be close to each other. This applies not only to consecutive terms, like 
12345 and X12346, but also to any two terms with large index, like 712345 
and LO8765- 


¢ A minor convenience: The condition n > m in the definition is sometimes 
convenient in proofs. This apparent asymmetry is actually harmless, be- 
cause the case n = m is trivial, and we lose no generality in using n to 
denote the larger of two indices. 


EXAMPLE 1. As we know, the sequence {1/n} converges to zero, while {./n} 
diverges to infinity. Is either sequence Cauchy? 


SOLUTION. The sequence {1/n} is Cauchy. Intuitively, we expect that, since 
terms approach zero, they must also stay close to each other. To prove this for- 
mally, let € > 0 be given, and set N = 1/e. This N “works,” since ifn > m > N, 
then 


as the definition requires. 

The sequence {./n} is not Cauchy—even though the difference between suc- 
cessive terms (like 123456 = 351.363 and 123457 = 351.364) tends to zero. 
We can prove this formally by showing that if « = 1, then no N works in the sense 
of the definition. Indeed, let N be any positive number, and let k be any positive 
integer with k? > N. If we set m = k? and n = (k + 1)?, then 


n>m>N but /n—-V/m=1=€, 


and so the definition is not satisfied. ©) 


2.4. Cauchy Sequences 


EXAMPLE 2. The sequence {s,,} at the beginning of this section is Cauchy. 
Explain why. 


SOLUTION. The key idea, on which a formal proof can be based, is the se- 
quence’s back-and-forth behavior: 


$2 < 84 < 86 < 8g +++ $7 << $5 << 53 < $1, 


combined with the fact that the distance between successive terms tends to zero. 
If, say, € = 0.001, we could set N = 1000, and observe that if n > m > 1000, 
then 


81000 < Sn, $m S $1001 = $1000 + 1001 


which implies that s,, and s,,, lie within 1/1001 of each other. © 


Morals from the examples. The examples suggest, correctly, a close connec- 
tion between sequence convergence and the Cauchy property. Indeed, the two 
properties turn out to be equivalent. As we’ll see, it is easy to show that every 
convergent sequence is also Cauchy. The fact that every Cauchy sequence con- 
verges is deeper, and proving it takes a little more work. The Bolzano—Weierstrass 
theorem—another convergence guarantee—will be useful. 


Properties of Cauchy Sequences 
First we pick two low-hanging fruits. 
Proposition 2.17. If {x,} converges, then {x,,} is Cauchy. 


The basic idea is that if both x, and x,,, are eventually close to a limit L, then 
they must also be close to each other. The proof makes this precise. 


Proof: Suppose 2», > L. For given e > 0, we can choose N such that |x, —L| < 
€/2 whenever n > N. This N works in the Cauchy sequence definition, because 
ifn >m > N, then 

€ 


€ 
tn — tm] Stn —L]+]am—-Lh<5+5 


HS 6 


we used the triangle inequality in the first step. 


Proposition 2.18. If {x,,} is Cauchy, then {x} is bounded. 


Proof: For € = 1, choose N as in the Cauchy definition. Now fix any integer 
mo with mp > N. If n > mo, then we have |x, — 2m,| < 1, or, equivalently, 
Lmy —1 < Ln < Um, +1. In particular, the set {2m @mo+1;%mo+2---} is 
bounded (above by x, + 1 and below by z,,, — 1). Since the remaining terms 
{x1,£2,...L%m,—1} form a finite set, the entire sequence {z,,} is bounded. 
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Why is it OK to use €/2, not €? 
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Watch for another «/2 trick. 


Infinitely many possible xj 
exist; any one will do. 


Some decimals are rounded. 


2. Sequences and Series 


Combining Proposition 2.18 with the Bolzano—Weierstrass theorem lets us prove 
our main theoretical result. 


Theorem 2.19. A sequence {x,,} converges if and only if {a} is Cauchy. 


Proof: Proposition 2.17 is the “only if” part. To prove the “if” part, suppose that 
{xn} is Cauchy. Proposition 2.18 implies that {x,,} is bounded; by the Bolzano— 
Weierstrass theorem, {x,, } has a subsequence {x,,, } that converges to some limit, 
say, L. To complete the proof, we’ll show that the full sequence {,,} also con- 
verges to L. 

Let « > 0 be given. Since {z,,} is Cauchy, we can choose N; such that 


|Un —Lm| <€/2 whenever n>m> Nj. 


Because {2p } converges to L, we can choose a member of the subsequence, say 
xn, with N > N, and |xy — L| < €/2. Now this N works in the definition of 
convergence of {2} to L, because if n > N, then 


€ 


€ 
Jan — L| < |tn — tn] + \an —L] <5 + =€6, 


2 
as desired. 
EXAMPLE 3. A “random walk” sequence a1, a2, a3,... is produced iteratively, 
with help from a fair coin. First we flip the coin and set a; = 1 if heads show 
and a; = —1 otherwise. Then we flip again and set az = a, + 1/2 for heads and 


ag = a, — 1/2 for tails. We flip again at each stage, and set 


Qn — + if tails. 


Gn + sy if heads, 
antl = 
Hy 2 


Particular values of a, depend, of course, on random coin flips. Here’s what 
happened on one try: 


=a, 0G.  VslOb, S25). SL 1s75s -a1a5es: 
—1.1406, —1.1484, —1.1445, -1.1426, 1.1436, 


Must every such sequence converge? 


SOLUTION. Yes, thanks to Theorem 2.19. The sequence {a,,} is Cauchy—no 
matter how the coin falls. To prove this formally observe first that, for any a, and 
Am with n > m, we have 


2.4. Cauchy Sequences 


1 1 1 
tn — aml = | gm ~ mri Qn-1 
1 i 1 
Som t omer" Te ee 
a oe 1 
~ gmai (G++ a =) 
ea 
Qma1" 


Now for given « > 0, we can choose N with 1/2"~1! < e. This N works in the 
Cauchy definition, because ifn > m > N, then, as just shown, 


lan — Gm| < <eé, 


Qm-1 = Q9N-1 
as desired. ©) 


Cauchy, convergence, completeness: crucial C-words. This section’s main re- 
sult, that Cauchy sequences converge, is not obvious; proving it took planning 
and effort. The work is worthwhile because it guarantees that sequences like 
those of Example 3, for which the terms become close together in a strong sense, 
do indeed have specific numerical limits. Without such guarantees it would be 
difficult, for example, to define the integral of a function (another kind of limit) 
unambiguously. 

Another C-word—completeness—is at the heart of Theorem 2.19. Complete- 
ness appears implicitly in the proof, by way of the Bolzano—Weierstrass theorem, 
which itself required the completeness axiom. (We invoked it in proving Proposi- 
tion 2.12, page 95, for example.) Indeed, completeness is sometimes defined not 
in terms of the supremum (as was done earlier; see page 65) but rather in terms of 
Cauchy sequence convergence. We pursue this connection briefly in the exercises. 


Exercises 


1. Show that each of the following sequences either is or is not Cauchy. (Use 
the definition, not theorems.) 
(<)" 
= fF 
(b) {Yn}, where yp, = See 
(c) {zn}, where z, = 


(a) {x,,}, where x, = 


a 
n+1° 
(d) {w,}, where w, = wat: 


2. Let {x} and {y,} be Cauchy sequences and C a constant. Use Defini- 
tion 2.16 to show that {x + Yn} is Cauchy, too. 


As we'll do later. 
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2. Sequences and Series 


. Let {x,,} be a Cauchy sequence such that 21 = 0 = X19 = X100 = X1000 


.... Use Definition 2.16 (not theorems) to show that {x,,} converges to 
zero. 


. Let {x} be a Cauchy sequence, with x, € Z for all n. Show that {x,,} 


is “eventually constant’”—i.e., there exists N > 0 such that r, = rm, 
whenever n > m> N. 


. As we Said at the end of this section, completeness can be defined using 


Cauchy sequences: a subset A C R is complete if every Cauchy sequence 
{x,,} contained in A has a limit in A. Use this definition below. 


(a) The set Q of rational numbers is not complete. Show this by finding a 
Cauchy sequence {r,,} of rational numbers that has no rational limit. 


(b) Is the set Z of integers complete? 
(c) Show that [0, 1] is complete but (0, 1] is not. 


. From any given sequence {x,,} we can form the related sequence {y,,} = 


{5a + 2}. 
(a) Use the definition of a Cauchy sequence (not theorems) to show that 
if {x,, } is Cauchy, then so is {yp}. 


(b) Use any convenient theorems to give a shorter (not necessarily better) 
proof that if {x,,} is Cauchy, then so is {Yn}. 


. Let {a,,} be a Cauchy sequence. Show that if {2,,} is a sequence such that 


|tn — Lm] < 42 lan — am| for all n and m, then {2,,} is Cauchy, too. 


. This problem is about the sequence {a,,} in Example 3, page 102. 


(a) Under what circumstances is {a,,} monotone? 


(b) Suppose the first six coin tosses show HT HT HT. Find the smallest 
interval in which the limit can lie. 


(c) Explain why —2 < a, < 2 for all n (regardless of the coin tosses). 


. Consider the {7,,} defined by 7, = 3.1, m2 = 3.14, 73 = 3.141, ...; for 


each n, 7, shows the first n decimal digits of 7. Show (use the definition, 
not theorems) that {7,, } is Cauchy. 


. This problem is about the fact that if {2 } and {y,,} are Cauchy sequences, 


then {Yn} is Cauchy, too. 


(a) Use Theorem 2.19, page 102, and any other already-proved results to 
explain why the fact holds. 


2.5. Series 101: Basic Ideas 


(b) (Harder.) Prove the same fact without using Theorem 2.19, page 102. 
(Hints: An algebraic trick and the triangle inequality give 


< lCnYn _ LnYm| a |2nYm = Lintra| 


Now use boundedness of both {x,,} and {y,} (Proposition 2.18) to 
complete the proof.) 


11. Let {an} be a sequence for which successive terms are “very close”: 
|a@n41 — Gn| < 1/2”*? for all n. Show that {a,,} is Cauchy. (Hint: For 
any n and m with n > m, we have 


lan — Am| < l\@m+1 — Am| + |@m+2 — Am41) +--+ + lan — Gn—1| 
1 1 1 
Som ome 


How large can the last expression be?) 


2.5 Series 101: Basic Ideas 


An infinite series is a sum with infinitely many summands, or terms: 


co 
So an = a1 +42 +434... 
k=1 


Here are two important examples, each of which we will revisit: 


As usual for infinite processes, some obvious questions arise. Is there some limit 
(or “infinite sum’’) to which the series converges? How do we decide? If a series 
has a limit, how do we find it? 

A first observation is that series are close kin to sequences—about which we 
already know a lot. Indeed, every series )> ax corresponds in a natural way to 


a certain sequence {A,,} whose properties tell us “everything” about the series. We'll define it in a moment. 


The flaw in this happy scenario is that, in practice, getting one’s hands directly on 
the sequence {A,,} can be difficult. Much of what follows can be thought of as 
strategy for getting around this problem. 

Enough generalities. 
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Much more about geometric 
series soon. 


More soon about this, too. 


Thanks, technology; values are 
rounded. 


By Theorem 2.3 page 78 


2. Sequences and Series 


Definition 2.20. For a given series ae dk = 4, +a2+a3+..., the sequence 
of partial sums is given by 


Ay=a,, Ag=art+ag, ..., An =G1 +ag+---+4n, 


In general, A, = Sy, Gk. 


EXAMPLE 1. Describe the partial sum sequences for the two series illustrated 
above. Do they have limits? 


SOLUTION. The series 
oS a Wy 4 
doa = Doge =ltgtgte. 
k=1 k=1 


comes from the geometric family, in which successive terms differ by a constant 
multiple. Here we have 


1 3 1 1 7 1 15 
A, =1, Ag Its 3 Age dang + Gas Aqg= Ast oa oe 


Now it is easy to guess—and to prove by induction—that 


2” —1 1 
An = Qn-1 =2 Qn-1 
Clearly, A, — 2, so it makes sense to write 1 + 4 + + + + ++++ = 2. We'll 


sanctify this below with a formal definition. 
The series 


is the famous harmonic series. Here are the first few partial sums H,,, also known 
as harmonic numbers: 


1 3 dae 
A, = 1, Bai LG 5) eS Soh hg tae H4= H3+7-=—. 


No pattern seems obvious, so let’s try some numerical experiments: 
9 = 2.929, Haz3 = 6.649, H7ggi = 9.549, Hi2345 = 9.998. 


The #7, are clearly increasing, but slowly. And mysteries remain: is the sequence 
{H,,} bounded, and therefore convergent, or do the harmonic numbers diverge to 
infinity? 


2.5. Series 101: Basic Ideas 


It turns out—as you probably know, and as we will review in exercises—that 
{H,,} diverges to infinity. This has been known for at least 650 years, thanks to 
the French philosopher and bishop Nicole Oresme. The moral for now is that this 
partial sum sequence has no simple formula, and so requires some work-around 
effort to analyze. This state of affairs is typical—most series don’t have partial 
sum sequences with nice, “closed form” expressions. Exceptions, like the first 
series above, are precious. ©) 


Series: The Basics 


Our systematic discussion starts, as usual, with a formal definition. 


Definition 2.21. Let Si az be a series of numbers, and {A,,} its sequence 
of partial sums. If {A,,} converges to a finite limit A, then we say )77-, ax 
converges to the sum A, and write )*7°_, a, = A. Otherwise, >7°_, ax diverges. 


Notes on notation. Sigma notation packs a lot of information into a small suit- 
case—and so may need careful unpacking. For instance, all of the following 


107 


expressions mean exactly the same thing: Convince yourself by writing out 
some terms. 
foe) foe) 103 loo) foe) love) 
1 1 1 1 1 1 
Pee 7 ar ve) eres . 
yo ee ee 
k=1 j=) j=l j=104 j=0 z=13 


With due care taken, such cosmetic differences won’t cause trouble. For typo- 
graphical economy, moreover, we will write just )~ a, not 4 ay, When con- 
fusion seems unlikely. 

As another example of notational alternatives, note that the equation 


foe) n 

) ay = lim ) ar=A 
n— co 

k=1 k=1 


is a highly compressed form of the definition of convergence of a series }> ax, to 
the sum A. There is no shame—and often added clarity—in expanding things a 
bit, so we might write, instead, Efficiency isn’t everything. 


a, tag+tagt¢e:-= lim (a1 + ag + ag +++: +n) =A. 


n—oo 


EXAMPLE 2. Does the series )>>-, 1/(k? +k) = 3 +44... converge? If so, 
to what limit? 
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Try some by hand. 


2. Sequences and Series 


SOLUTION. By ahappy coincidence, the partial sums follow a nice pattern: 


1 2 3 100 

51 = 5) 52 = 3 53= 7) a) 5100 = Tp 
Now it is easy to guess a formula for S,,, and to see that S, — 1. (See the 
exercises for details.) ©) 


Nonnegative terms, increasing sums. Getting one’s hands on directly on partial 
sums can be difficult, as we’ve said. Things are simpler for a nonnegative series 
So az: one for which a, > 0 for all k. In this case the partial sum sequence 
{Aj, Ao, A3,...} is increasing: 


a, S a, + ag S a, +a2+ a3 5 a, +ag+ag3+a4<.... 


By Theorem 2.3, page 78, there are only two possibilities: Either {.A,, } converges, 
or it diverges to infinity. If we can somehow rule out either alternative, the other 
must apply. 


Algebra with convergent series. Convergent series, like convergent sequences, 
allow basic algebraic operations; the following theorem has a counterpart for se- 
quences in Theorem 2.5, page 83. Proofs of the following theorem, too, follow 
easily from those for sequences. 


Theorem 2.22 (Algebra with convergent series). Let )> a, and S> by, be con- 
vergent sequences, with respective sums A and B. 


© Sums and differences: 57> (ay, + bn) converges to A + B; equivalently, 


So (an +b,) = Sian + ye be 


* Constant multiples: > cay, converges to cA; equivalently, 


) ct eS An- 


Proof: We will leave the proof for sums as an exercise. For constant multiples 
we want to show, in effect, that the distributive law applies to (convergent) infinite 
sums. To do this, we need to show that the sequence of partial sums 


Cay, CA, + Cag, CA, + CaQ2 + Caz,... 


converges to cA. This follows from the distributive law for finite sums, and from 
what we already know about sequences. By the distributive law, we can rewrite 
the preceding sequence as 


cay, C(a1 + a2), clay + a2 + a3), ..., Clay tag+...dn),..., 
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or, equivalently, as 
cA, CA2, CA3,..., CAn, ---, 


where { A,,} is the sequence of partial sums of 5+ a,. But we know by hypothesis 
that A,, — A, and the constant multiple rule for sequences implies that cA, — 
cA, as desired. 


EXAMPLE 38. Does the series 


e eae ide bea 
Qe-l kf} De De A Be 
k=1 
converge or diverge? Why? 
SOLUTION. It diverges. The given series is the sum of two simpler series— 
one convergent and one divergent—discussed in Example 1. In fact, every such 


sum diverges. To see why, suppose that 5+ a, converges and )~ b, diverges. If 
So (Gn + by) were convergent, then, by Theorem 2.22, the series 


would also converge, against our assumption. © 


Detecting Convergence and Divergence 


Theorem 2.22 is nice to know. But it is of little use unless we know that some 
particular series converge or diverge. To this end, we need reliable methods for 
deciding whether a given series converges or diverges. We address that deficit 
next, starting with a simple test for divergence. 


Proposition 2.23 (nth term test). If a series )>>°_, Gn converges, then the se- 
quence {a,,} converges to zero. 


In practice, the contrapositive is especially useful: 
if {a} does not converge to zero, then > ay, diverges. 
The converse, 
if a, — 0 then >* a, converges, 


Don’t give in. (The harmonic 
is tempting but false: a series }> a, may diverge even if a, — 0. series is one counterexample.) 
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Watch the difference 
“telescope.” 


It’s readily proved by induction. 


2. Sequences and Series 


Proof: Observe first that, for all n > 1, 
An — An—1 = (ay ++++ + Gn—1 + Gn) — (€1 + +++ + Gn_1) = an- 


Now convergence of 5 a,, means that A, — A for some number A. But then 
A,—1 — A, too; therefore, 


lim a, = lim (A, — An—1) = lim A, — lim Ap-1 


n—oo n—- co n—- co n—- oo 


=A-A=0, 


as claimed. 


Geometric series: the nicest examples. Proposition 2.23 sometimes helps us 
identify divergent series. But we would still like a good source of convergent 
series; the geometric family fits that bill. 

A geometric series is one of the form 


Co 
y ar®—! =atar+ar?t+ar?+..., 
k=1 


where a and r are any fixed numbers; notice the common ratio r of each summand 
to its predecessor. The first series discussed above, for instance, is geometric, with 
ratio r = 1/2: 


Here is another example: 
3 + 3.3 + 3.63 + 3.993 + 4.3923 + 4.83153 + 5.3146834.... 


The geometric form may be slightly hidden in the numbers, but it is easy to check 
that successive terms have common ratio r = 1.1, and so the standard geometric 
template )>7°., 3 x 1.1*~? fits the pattern. 
Geometric series satisfy a useful algebraic equation: 
4 1—r” 


atar+ar? +ar?+-+»+ar" iG (*) 
—r 


holds if r ~ 1, n is a positive integer, and a is any number. This pretty formula 
tells us, almost instantly, whether any geometric series converges and, if so, to 
what limit. 


2.5. Series 101: Basic Ideas 


Proposition 2.24. Let a and r be any real constants, with a £ 0. 


(a) If |r| <1, then >>, ar*~! converges to 4 


Tr- 
(b) If \r| > 1, then °°, ar*—" diverges. 
We can infer, for instance, that )7?°_, 0.5*—! converges to 2—as we proved in 


Example 1. By contrast, Sra 3 x 1.1*~! diverges—as the numerical evidence 
suggests, and as the nth term test also implies. 


Proof (sketch): Convergence and divergence of series are all about partial sums, 
and Equation («) above tells almost the whole story. If r 4 +1 we have 


nm 
ae 1—r” 
) ar = a——. 
l-r 

k=1 


What happens to the right side as n — oo? The answer depends entirely on the 
r” in the numerator—and that depends, in turn, on the value of r. If |r| < 1, then 
r” — 0, and so 


Lar? 1 

—a ; 
l-r l-r 

as claimed. If |r| > 1, then {ar”} is unbounded, and so the series (badly!) fails 

the n'" term test, and so diverges. 


The remaining cases (r = +1) are much simpler, and left to the exercises. 


a 


EXAMPLE 4. Proposition 2.24 and Theorem 2.22 justify calculations like the 
following: 


fore) 3k 4 4k foe) 3k lore) 4k love) 3 j-1 foe) 4 j-1 
ap Pe ees) FL) 
k=0 k=0 k=0 j=l j=l 
1 1 15 
—_— + Se CO 
[246 1S4s 2 
Similarly, 
sali a OU aaa jee 
Wee Negi Gee go = a bp 2'°4 8 
k 


II 

SS) 

i 

T 

o 

i 

T 

Pe wlRe wle 

Ms 
a. 
Nile 


1 2 
=m7+e+ = ~ = 6.082. 
POP a Ted. a eG 
Numerical experiments support these results. According to Mathematica, 
50 9k k 50 k-1 
3° +4 1 1 
= 7.4993; a a a = 6.0821. % 
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Watch the tricky index change 
from k to j. 
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Comparison and comparison testing. Compare these two series: 


ee re 
: Dee De Be Be ge Te 


The first series we recognize—it is geometric and converges to two—but what 
about the second? Does it also converge, or might its partial sums diverge to 
infinity? 

Comparing the two series suggests some common-sense guesses. Because 
each term of the second series is less than its counterpart in the first, we expect 
the second series, like the first, to converge, and to some limit less than two. 
Sampling some partial sums {.S,,} from this series adds some credence: 


Comparing series for size in this simple way is valid—and useful. Details are in 
the following theorem and proof. 


Theorem 2.25 (Comparison test). Let > ay, and )> by be series, withO < ax < 
by for all k. 


(a) If >> by converges to a number B, then \~ ax, converges to a number A, 
with A < B. 


(b) If ¥> ay diverges, then )~ by, diverges, too. 


Proof: Let {Ay} and {B,,} be the partial sum sequences for 5* a, and S> bx. 
Because {A,,} and {B,,} are increasing sequences, each converges if—but only 
if—it is bounded above. By hypothesis, 


An = 41 +42 + a3 +++++ Gn < bi + b2 + 3 ++++ + dn = Bn 


for all n. If {B,,} happens to be bounded above, then certainly {A,,} is also 
bounded above, and we have 


A=sup{An} < sup{B,} = B. 


In this case A,, — A and B, — B, as (a) claims. Part (b) follows from part (a). 


EXAMPLE 5. Does each of the following series converge or diverge? What can 
be said about limits? 


2 — 1 | sink 
OD ee (OD a OD 
k=1 k=1 


2.5. Series 101: Basic Ideas 


SOLUTION. Series (a) converges by comparison to a convergent geometric 
series: 


2k 1 Qk 
3h SBR 
holds for all k > 1, and so 
een Mn 
Dae Ss ge 
k=1 k=1 


We can evaluate (a) exactly if we recognize it as the difference of two convergent 
geometric series: 


OU Ea, eee. etd it 
Dorges age Doge 
k=1 k=1 k=1 


pe ere en 
CAL PB ee 


The harmonic series ++ 4 + $ +... comes to mind, but term-by-term comparison 
goes the wrong way. A better strategy is to multiply the harmonic series by 1/2: 


The resulting series diverges (why?), and is comparable to series (b), which there- 
fore also diverges. 

For series (c), comparison with prea 1/ QF is tempting, but some terms of (c) 
may be negative, and so Theorem 2.25 doesn’t apply. Still, numerical evidence 
suggests convergence; here are some (approximate) partial sums S,,: 


85 © 0.588433, $19 © 0.59333, sis © 0.59285, so9 © 0.59284. 


The case for convergence looks strong, but we don’t (quite yet) have a proof. 


Absolute Convergence 
Series (c) from Example 5 stumped us. The next theorem comes just in time. 


Theorem 2.26. If >> |ax| converges then S~ ax, converges too, and 


CO CO 
San] SS lanl 
k=1 k=1 
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It’s a triangle inequality, but for 
infinitely many summands. 
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We don’t know exactly where. 


We worked hard to prove the 
theorem; here comes the 
payoff. 


Watch for the (finite) triangle 
inequality. 


2. Sequences and Series 


Let’s polish off series (c) from Example 5 before proving this. Clearly, 


|sin k| 1 
ae SR 


for all k, and so Theorem 2.25 guarantees that the left-hand series converges, 
to some positive limit less than one. Theorem 2.26, in turn, says that series (c) 
converges too, to a limit somewhere in the interval (—1, 1). 


The proof, informally. Suppose 5 |a,,| converges. To show that )~ a, converges, 
too, we need to prove that the partial sum sequence {A,,} converges. Since we 
have no specific limit in mind, we will show that {A,,} is a Cauchy sequence, and 
then invoke Theorem 2.19, page 102. 

Observe first that, because 5~ |a,,| converges, its partial sum sequence 


Ci = lai|, Cy = lay| + |ao|, C3 = lay| + |a2| + las|, pene 


is convergent, and therefore Cauchy. We will use this fact and the triangle in- 
equality to show that {.A,,} is Cauchy, too. 

Let « > 0 be given. Because {C’,} is Cauchy we can choose N such that 
|Crn — Cm| < € whenever n > m > N. The same N turns out to work for { A, }, 
because if n > m > N, then 


|An — Am| = \(@1 + -++ + @m + Qm41 +++: + an) — (€1 +... Am)| 
= |dmi1 tamp2 +++: + an| 
< |am+i| + |a@m+2| +--+ + |an| 
= Ch — Cm = |Cn — Cm| <€. 


Thus, { A,,} is Cauchy and hence also convergent. 
Invoking the triangle inequality again gives 


|An| = a1 +@2 +--+ +an| < lar] + ae] +--+ + lan] =Cr 


for all n, and so 


> ax = lim |A,| < lim C, = oy |ax| - 
This finishes the proof. 


Conditional convergence. A series 5° a, for which 5° |a;| converges is called 
absolutely convergent. Theorem 2.26 says that every absolutely convergent series 
is also convergent in the ordinary sense. The converse, however, is false: A series 
>> ay may converge while )~|a;,| diverges. A series with this property is called 
conditionally convergent. 


2.5. Series 101: Basic Ideas 


EXAMPLE 6. The alternating harmonic series 


- 1 Lee Pale at 
SS ign ae he ge 
pO ae: a ae a 


ce 
Il 
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is conditionally convergent. Explain why. Can anything be said about limits? 


SOLUTION. Taking absolute values of all terms gives the ordinary harmonic 
series 


op eee 
a ae ee ee 


which we know to diverge. To see why the alternating version converges, consider 
the partial sums {S', }: 


1 1 1 1 1 1 ds fd od 

1ll—-=,1-=+-+,1-=+-+---,1-=+-=--4+2-.. 

, 2’ 273 ae 4’ 273 riga 
Here’s a (rounded) numerical view of S1,..., S10: 


1.00, 0.500, 0.833, 0.583, 0.783, 0.617, 0.760, 0.635, 0.746, 0.646,.... 


Notice the pattern: Successive partial sums alternately increase and decrease, but 
with smaller and smaller gaps. In particular, for any N, every term S,, withn > N 
lies between S'y and Sy +1. If n > 1000, for example, then 


0.69265 = S'1000 < Sy, < S'i001 ~ 0.69365. 


The same holds for two terms. If n > m > 1000, then both S, and S,,, lie 
between 0.69265 and 0.69365, and so lie within 0.001 of each other. Using this 
idea we can prove that {S,,} is Cauchy, and hence convergent. We leave details 
to the exercises. 

So what is the limit? The striking but far-from-obvious exact answer turns 
out to be In2 = 0.69315. A rigorous proof is beyond our scope, but numerical 
evidence shows we are in the ballpark. ©) 


Exercises 


1. For each series following, find a formula or simple rule for the partial sums 
S,. Then decide whether the series converges or diverges, and to what 
limit. 

(a) 372.,0=0+0+0+... 
(b) 072, 42 = 42 +42+42+... 
(c) DR (-1)*¥ =-14+1-1+... 


The signs alternate. 
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(d) Oey k=1t243+.~... 
() 3p 0.99" 


(f) S02, L/(k? + k) (Hint: Guess a formula for S,,; prove it by 
induction.) 


2. This problem is about partial sums H,, of the harmonic series (see Exam- 
ple 1, page 106). 


(a) Oresme’s (fourteenth-century) proof uses inequalities like the follow- 
ing: 


NIR 


1 1 1 
a he BG eo oe 


Deduce that Hp» > 3, and so H, — ov. 


> 


(b) Consider the inequality 
Ay eee (ee Oe aie ea 
a 2 a A 5 6 In—1' In 
ns 1 1 
>S>aetet+a4¢...—-==4+Hp. 
~ 2 = 2 = 3 ca n 2 as 
Use this to show (again) that {H,,} diverges. (Hint: If {H,,} con- 
verges, then the subsequence { H2,, } converges to the same limit.) 
(c) Yet another proof that H,, converges follows from the inequality H,, > 


In n for all positive integers n. Lacking a formal definition of Inn, we 
can’t prove this rigorously. Instead, draw a picture illustrating the in- 
equality; consider Inn to be the area under the graph y = 1/x from 
z=ltor=n. 


3. Decide whether each of the following series converges or diverges. The 
comparison test may be useful in some parts. 


Be 1 

(@) ies k+ 2h 
= k 

(b) ie 2k? — 1 


co eae e: 
(c) \o-1 s¢.~—- Hint: Compare to a geometric series.) 


©) Vem 3a 


4. A telescoping series is one for which the partial sums S,, “collapse” to 
some shorter, simpler form. 


2.5. Series 101: Basic Ideas 


(a) Forthe series }>7- (¢ _ rat)» write out (by hand) S2 and S. Do you 
see the “telescoping” behavior? Find a formula for S,, and deduce the 
limit. (Note: This series, rewritten as )>7°., 1/(k? + k), appears in 
an earlier problem.) 


(b) For the series 7°24 (¢ — ers)» find a formula for S,, and then find 


the limit. (Hints: Write out, say, S19 to see the pattern for S',.) 
(c) For the series )P° 4, (¢ — win)» find a formula for S,, and then find 


the limit. (Hints: Use ideas from the preceding parts. No proofs 
needed, but check your claim numerically, using technology.) 


5. Prove the claim about sums and differences in Theorem 2.22, page 108. 
(Note that the claim amounts to saying that addition is commutative for 
convergent infinite sums; we already know that addition is commutative 
for finite sums.) 


6. This problem ties up some loose ends concerning Proposition 2.24, page 111, 
which addresses convergence and divergence of oS i aos 


(a) How does the series look if r = 1? What if r = —1? Prove (using 
partial sums) that the series diverges in both of these cases. 


(b) In the proof of Proposition 2.24 we used several reasonable-seeming 
facts about convergence and divergence of the sequence {r”}, de- 
pending on the value of r. Prove them, as follows: 


(i) If r and L are any real numbers and r” — L, then r?+! > L, 

too. 

(ii) If r and L are any real numbers and r” — L, then r™+! = 
rr’ orl, 

(iii) If A Oandr” — L, then L = 0. (Hint: Use the two preceding 
parts.) 

(iv) If 0 < r < 1, then r” — O. (Hint: Use a theorem about 
monotone sequences.) 

(v) If-l<r<0, then r” — 0. (Hint: Squeeze.) 

(vi) If |r| > 1, then {r”} diverges. (Hint: It is enough to explain 
why r” — 0 is impossible.) 


7. This problem is about the alternating harmonic series. See Example 6, 
page 115, where we said that the partial sum sequence {5S,,} is Cauchy. 


(a) Claim: For any positive integer N, every partial sum S,, with n > N 
is between (or equal to one of) Sy and Sy41. Prove this for N = 
1000. (A general proof is similar.) 
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(b) Show that, for any « > 0, we can find N such that |Siv — Sy4i| < €. 
(c) Use the preceding parts to show that {.5;, } is Cauchy. 


8. This problem explores implications of Theorem 2.22, page 108. Through- 
out, let 5> c;, be a convergent series and let )~ dj, be a divergent series. 


(a) Use Theorem 2.22 to show that > d;,/10 diverges. 


(b) Can 5+ c,d, converge? Can it diverge? Give examples or proofs as 
needed. 


(c) Let A be a constant. Can }> Ac; diverge? Give a proof or counterex- 
ample. 


(d) Is it possible that d;, < cy, for all k? If so, give an example. If not, 
why not? 


9. Let 07°, ax and S>7°°_, by be two series, and suppose that a;, = b;, when- 
ever k > 42. 


(a) Show that $* a; converges if and only if 5+ b;, converges. 


(b) Suppose that 3>7°, ag = 100 and that b, = aj, +k? for 1 < k < 42. 
Find 7°, be. 


10. Let $> a, be a series with positive terms. Show that if 5+ a, converges, 
then 5~ az converges, too. (Hint: Compare the series }* a, and S> az.) 


11. In discussing geometric series we invoked the formula 


— rn 
atar tar? + ar. t ar"? = a5 " 
—r 


for the partial sums of a geometric series with r 4 1. Prove by induction 
that this formula holds for all positive integers n. 


2.6 Series 102: Testing for Convergence 
and Estimating Limits 


The preceding section was about definitions and basic ideas. Now we proceed 
to deeper investigation of the two main questions about any given series: Does it 
converge? If so, to what limit? 

We start by expanding our catalog of “known” series. 


2.6. Series 102: Testing for Convergence and Estimating Limits 


Meet the family: p-series. A p-series has the form 
‘s oe 14 3 1 Lh Di 
iP Pp + opt 3p 5p 


where p is any fixed number. With p = 1 we have the harmonic series, which 
we’ve shown to diverge. With p = 2 we get 


“1 1 1 1 1 
ead ptgtgptpt 
k=1 


which converges, as we'll prove in two different ways. The general story on 
convergence and divergence is simple: 


Proposition 2.27. The p-series )- >, ~ converges if p > 1 and diverges ifp < 1. 


Notes toward a proof. For p < 1, divergence follows by comparison to the har- 
monic series: For all integers k > 1, we have 


kP <k, andso > 


al 


1 
kp 
Because the harmonic series diverges, so must every p-series with p > 1. 

Showing convergence for p > 1 takes a little more effort. We illustrate the 
argument for p = 1.3 and leave a general proof to the exercises. 

As with any positive series, convergence occurs if the partial sum sequence 
{S;,} is bounded above. “Our” proof of this uses some clever algebraic book- 
keeping to show that the subsequence 


Si, S3, Sz, Sis, S31, sey Son_1, eee 


is bounded above; this implies, in turn, that the entire sequence {.5;, } is bounded 
above. The idea is to group summands into blocks of length 1, 2, 4, 8, ..., and 
then estimate the contribution of each block. We illustrate this for S3,. The 
inequality holds because the first term in each block is largest, and the last equality 
comes from counting the terms in each block: 


a 1 1 1 1 at 1 1 
S31 Sltastgstast--tastgat- ty tiga tt as 
a 

1 1 1 1 i 1 1 1 1 

= Tors ' 913 ' qs 7 tara t+ gra t+ + gra tT y6gi3 1613 
a 

8 16 1 1 il 
=1+ fF ae smo =lt+astamst + 


It gets easier when we know 
about integrals. 


It’s probably centuries old. 


Do you see why? 
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Mathematica says Sio90 & 3.51 
and Sooo9 + 3.59; the numerical 
evidence is reassuring but only 
anecdotal. 
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Here comes the punchline: The last quantity can be rewritten in geometric form: 


1 1 1 1 
p03 t goat g03 7 [603 


1 ey ‘Ae ae 
=I+s0at\|o3}] +\ 503) + | o03 


le) 1 k-1 le) 1 
a ew .81225"-1 = —__-___ = 5.326. 
<> (as) 208 : [oases 


S'31 <1+ 


Now we are getting somewhere. We’ve shown, by painstaking calculation, that 
S31, and (by the same argument) all partial sums Sgn_1, are bounded above by 
the sum—about 5.326—of a convergent geometric series. This completes the 
proof for p = 1.3; other exponents p > 1 can be handled similarly. 


Sharper Tests for Convergence 


Now that we understand the convergence behavior for two important families— 
geometric and p-series—we can study convergence and divergence more effec- 
tively. First we need some more discriminating convergence tests. 


Limit comparison test. The ordinary comparison test (Theorem 2.25, page 112) 
is simple and powerful, but can be hard to apply. The limit comparison test can 
simplify the work. 


Theorem 2.28 (Limit comparison test). Consider two series )~ ay, and )> by with 
positive terms, and suppose that {ax,/b,} converges to a finite limit L. 


(a) If >> by converges, then )~ ax, converges, too. 
(b) If L £0, then either both \~ ay, and )~ by converge or both diverge. 


Before proving the theorem, let’s try it out. 


EXAMPLE 1. We showed in Example 2, page 107, that the series )77-., by = 
Sor, L/(k? + k) converges, with sum 1. Use limit comparison to deduce that 
some other series converge. 


SOLUTION. We have already shown that the p-series }> a, = S~>1/k? con- 
verges, but the proof was laborious. Limit comparison is easier. Here is the key 
limit: 

A 8 ie . ke+k 


=> im 


ak 
=) im ——<—_—_—$?— 
k-00 by k-o0o 1/(k? + k) k-00 k2 


1 
= limil+-=1 
peas ae , 


which implies (again) that )> 1/k? converges. 


2.6. Series 102: Testing for Convergence and Estimating Limits 


Similar arguments apply to uglier series. If, say, 


= k? + 7k — 3sink 
y=) 


then a look at powers of k in the numerator and denominator suggests limit com- 
parison with )>1/k?. The strategy turns out to work: 


? ant fon Rt TA? = 3k? sin k 
fesse @ ae eee 2k4 —k 
= 14+ 7/k—3/k?-sink = i 
k—-o00 2- 1/k3 2 
Because the limit is finite, the ugly series converges. © 


We'll prove Theorem 2.28 using two auxiliary facts, each of interest in its own 
right. 


Proposition 2.29. Suppose ay, = by for all k > N, where N is any constant. 
Then either both )~ ay, and )~ by, converge (not necessarily to the same limits) or 
both diverge. 


Proof (sketch): The trick is to compare the partial sum sequences {A,,} and {B, }, 
which turn out to resemble each other. Here is the idea for N = 42; a general 
proof is almost identical. If a, = by, for k > 42, and we know n > 42, then 


An = 41 ++++ + G42 + Gag ++++ + On 
= A, +--+ +42 + bag +--+ + dn 
= (a1 + +++ + Gaz) — (br + +++ + baz) + (b1 + +++ + baz) + bag + +++ + bn 


Thus, A, = Ago — Bag + By for n > 42, which implies that the sequences 
{A,,} and {B,,} either both converge or both diverge. If, in fact, B, — B, then 
An — Aaa — Bag + B. 


Proposition 2.30 (Bounded comparison test). Consider two series )~ ax, and 
>° by with positive terms, and suppose that the sequence {aj,/b,} is bounded 
above. 


(a) If )> by converges, then > ax, converges, too. 


(b) If 3} ax, diverges, then )~ by, diverges, too. 
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Note the algebra with sequence 
limits. 
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The second series is a constant 
multiple of the first. 


Sorting out the a’s and b's is a 
little confusing ... but it all works 
out. 
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Proof: The hypothesis means that, for some positive number M, we have a, < 
Mb, for all k. Now if 5+ b, converges, then 5* 1/b;, also converges. By the 
(ordinary) comparison test, )> a, must also converge, as claimed in (a). Part (b) 
is the contrapositive of (a). 


The limit comparison test is a special case of Proposition 2.30, so the proof is 
easy. 


Proof (of Theorem 2.28): Because {ax,/b,} converges, it is also bounded above, 
and so Proposition 2.30 implies part (a). 

Now suppose, as in (b), that L 4 0. In this case {b;/a,} converges to 1/L, 
and we can apply (a) to conclude that if > a, converges, 5+ bj, must do so, too. 
This establishes the claim in (b) about convergence; the statement about diver- 
gence is equivalent. 


The ratio test. For a geometric series, the ratio of successive terms is a constant r, 
and convergence depends on the magnitude of r. The ratio test generalizes this 
principle to a large class of series that, although not geometric, behave “in the 
limit” something like geometric series. The theorem makes these ideas precise. 


Theorem 2.31 (Ratio test). Consider a series )~ ay with positive terms, and sup- 
pose that the sequence {ax41/ax} converges to a finite limit L. 


(a) If L < 1 then > ax, converges. 
(b) If L > 1 then > ax diverges. 
(c) If L = 1 the test is inconclusive; )~ ay, may converge or diverge. 


Let’s try it out. 


EXAMPLE 2. What does the ratio test say about 


ia ede he ke -yrty 
1° 2 6 24 - kl 


What does it say about the harmonic series? 


SOLUTION. The ratio test says that the first series converges. The ratio in ques- 
tion collapses nicely: 


Qp41 1 /(k +1)! k! WD Sikes k 1 


a, —iL/kt Gee [eevee k-(k+1) k+l 


The last quantity converges to zero as k — oo, so the ratio test guarantees con- 
vergence. 


2.6. Series 102: Testing for Convergence and Estimating Limits 


The limit-of-ratios calculation is just as easy for the harmonic series: 


i(k+1) ik 


a 
etn t/etl)_ Fk 


Ak 1/k = =k+1 


ask — o. 


Because the limit is one, however, the ratio test says nothing. The wasted work 
is annoying, but only mildly so: we knew already from other arguments that the 
harmonic series diverges. ©) 


Proof (of the ratio test): We prove (a) and leave (b) and (c) to the exercises. Given 
L < 1, as in (a), choose any real number r with L < r < 1. Then the geometric 
series 1+r+r?+... converges, and we’ll use “bounded comparison” (Propo- 
sition 2.30) to show that )* a; converges, too. 

Doing so requires a basic observation about limits and an algebraic trick. The 
observation is that, because a441/a, — Land L <r, there must be a number NV 


such that 
Qk4+1 


ak 


whenever k> N. 


<7, 


To complete the proof, using Proposition 2.30, we’ll show that the sequence 
{ax/r*} is bounded. To do this it is enough to assume k > N. In this case, 
we can write (here comes the algebraic trick) 


Gk Ak GAk-1 Ak-2 aNnN-1 aN 
ve Qk-1 Ak-2 Ak-3 an 
an pk-N an an 
KOT es LS = Se 
eoi~oOoo” CUT r T 


k — N factors 


Now the last quantity is constant (independent of k), and so the sequence is 
bounded, as desired. 


Not-necessarily-positive series. Most of our convergence tests require positive 
series, and so don’t apply directly to series like 


co . . . . . 
sink sinl  sin2- sin3_ sin4 


ge ay ge eS PG 
k=1 


and 
3 (—1)*-1 1 1 1 1 1 


which have both positive and negative terms. What can we do? 

We’ ve already seen one work-around: test ~ |a;,|, not 5> ax, for convergence. 
Theorem 2.26, page 113, guarantees that if the former converges, so does the 
latter. Indeed, we already used this method (Example 5, page 112) to show that 
the first series converges. 


The ratio test wasn’t the right 
tool for this job. 


Plenty of choices exist. 


The remaining terms, with 
k <N, form a finite set. 
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It's a p-series with p = 1/2. 


A series that starts with 

—a; + €@2—...needs a tiny 
change in notation, but no new 
ideas. 
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This approach fails for the second series, however, because 


» 


k=1 


(—1)*-1 


1) De pet ie tc 


“A RB 


which diverges. We need another strategy. 

The series S>(—1)*~!/Vk resembles the alternating harmonic series 
>>(—1)*"1/k, which we’ve shown to converge. (See Example 6, page 115.) 
The former series turns out to converge, too, and for essentially the same reasons. 
The following theorem applies to these and to other alternating series—ones for 
which successive terms alternate in sign. It is convenient to write such a series in 
the form 


[oe) 
So(-1)* Tax =@,—a2+a3-..., 


where az > 0 for all k. 


Theorem 2.32 (Alternating series test). Let >? ,(—1)*~! a, be an alternating 
series such that 


(i) the sequence {ax} is positive and decreasing: a, > a2 > a3 >--+- > 0; 
(ii) ax ~ 0ask > ow. 


Then the series converges, and the limit S lies between any two successive partial 
sums Sy, and Sy+1. 


EXAMPLE 3. Finish off the series )>7°_, (—1)*-1/Vk. 


SOLUTION. Conditions (i) and (ii) of the theorem hold for our series: 


: > : > : > > : 0 
— — — ee —_ -> 
V1 V2" V3 VEO 
as needed. Thus our series converges to some limit S, which lies between any two 
successive partial sums. For instance, we have 


Sio0 & 0.55502 < S' < 0.65453 & S101 


and 
Sio02 & 0.58911 < S' < 0.62068 & S993. 0) 


2.6. Series 102: Testing for Convergence and Estimating Limits 


Proof (of Theorem 2.32): The proof depends on an nice pattern among the partial 
sums: 

So < 84 <S6 < Sg <---< $7 < S5 < $3 < S}. 
All of these inequalities hold because the terms alternate in sign but decrease in 
size. The reasons are straightforward but pretty, and best thought through for 
oneself. 


In particular, the odd-index subsequence S},53,55,... is decreasing and 
bounded below (by any even-index term, such as Sg), while the even-index sub- 
sequence $2, .54,56,... is increasing and bounded above. Hence both of these 


subsequences converge. They tend to the same limit, moreover, because 
Son — Sox—-1 = A2k, 


and the right side tends to zero as n tends to infinity. This implies, finally, that 
{S,,} itself converges, as desired, and that the limit, S, satisfies the main ine- 
quality: 


Sa Sy Se RS y se oe Oe ey S BES aX Sis 


Estimating Limits 


We have now acquired and sharpened several tools for detecting whether a lot of 
series converge or diverge. That’s nice, but what about the limits themselves? The 
bad news is that exact limits may be hard or impossible to find. (The family of 
geometric series is a rare but important exception.) The good news is that the same 
tools, cleverly employed, can also help us estimate limits with good precision. 


EXAMPLE 4. We used the ratio test in Example 2 to show that the series 


ee eae eee ee 
ee od Ae Oke ag 


converges. What can we say about a limit? 


SOLUTION. Listing the first few (rounded) partial sums S’,, 
1, 2, 2.5, 2.66667, 2.70833, 2.71667, 2.71806, 2.71825, 2.71828, 2.71828, 


suggests a limit somewhere near Si9 * 2.71828. Could the limit be the number e? 
Can we be sure? The S;, are strictly increasing, after all, so what would prevent, 
say, 5123456789 > 3? How much can the rest of our series 


i 1 i ae | 
mtatimt > Le 


contribute, at most? 


Start small; see, e.g., why 
S5 > Sz. 
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Says Theorem 2.26, page 113. 


The last inequality comes from 
comparison to the geometric 
series. 
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The comparison test idea gives a quick (if slightly crude) answer. Since 


we can compare with a geometric series, for which we do know a limit: 


1 1 1 1 1 1 2B ad wane 
i fl dg es om pie ee se 


This tells us that Sj9 + 2.71828 underestimates the true limit by less than 0.002, 
so we can have confidence in (at least) the first three digits. © 


Bounding the tail: lessons from Example 4. For any convergent series )* ay, 
and any partial sum S,, we can write 


Co 
So ax = (41 +02 +++>+4n) + (Qnti t+ Qngat...)=Sn+ Rn, 
k=1 


where R, = byosieer Gx is the nth upper tail, or remainder. 

The idea illustrated in Example 4 is to bound the tail: If we can somehow 
show that |R,,| is small, then the limit S = S,, + R, must be close to the finite 
sum S;,, which we can compute without any worries over convergence. Upper 
tails are often bounded by comparison to suitable geometric series, for which 
(atypically) we can find exact limits. 

We should confess, finally, that the exact limit of the series in Example 4 is 
well known to be e © 2.71828182846. A rigorous proof requires methods we 
haven’t developed; our point here is that reasonable accuracy is available using 
basic arguments. 


EXAMPLE 5. An alternating version of the series in Example 4 looks like this: 


i. it Aa Al i ay a 
(ft Sp a hh a ‘ 
1 Oe On. og d k! 


What can we say about a limit? 


SOLUTION. First, there is a limit. We showed in Example 2 that our series 
converges absolutely, and so it also converges in the ordinary sense. Second, our 
tail estimate from Example 4 can be recycled here: 


fe pode, <a ee ee 1 
Hap aan eee = aa aa ae ee 


|Riol = |- Ty 121-73! 12! ' 13! 99° 
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For the present series we find Si9 ~ 0.367879, and we conclude that the exact 
limit lies somewhere within about 0.002 of Sg. 

But we can do better—with less work—using Theorem 2.32, which guaran- 
tees that the exact limit lies between any two successive partial sums. Calculating 
So & .3678792 and Sig % 0.3678795 gives us about six decimal places of accu- 
racy. Again, our estimate plays well with the exact limit, which can be shown by 
other methods to be 1/e * 0.367879441. © 


Exercises 


1. Use methods of this section to prove as efficiently as possible that each 
series converges or diverges. (These same series appeared in exercises for 
the preceding section; now we know more techniques.) 


ae 
@ > ae 
ram k+2 
ok 
b oe 
©) > oe 
k=1 
(c) », 3k (Hint: Compare to a geometric series.) 
k? +2 
@ Se BR +a 
2. Prove or disprove that each of the following series converges. 
alte 
(a) aS: 
sink 
b ees 
() = 1.0001* 


CO 


ke+k+1 
(©) > BR +R+I 


(d) a (Gr 
regs In(k + 1) 
3. Throughout this problem, let 5> a;, be a series with positive terms. 


(a) Show that if 5> a, converges, then >> az converges, too. (Hint: Use 
limit comparison.) 
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(b) Give examples to show that if 5 a, diverges, then )* a? may either 
converge or diverge. 


. In the situation and notation of Proposition 2.28, page 120, show that if 


>> by diverges and a;,/b;, — co, then )~ ay, diverges, too. (Hint: If ay, /bp 
oo, then by,/az, — 0.) 


. Complete and prove the following version of Proposition 2.29, page 121: 


Consider two series 5> ax, and )> by such that aj, = by fork > N. If > by 
diverges, then > a, diverges, too. If }> b, converges to B, then S~ az 
converges to : 


. In Example 2, page 122, we used the ratio test to show that arate 1/k! 


converges. Prove the same thing using the comparison or limit comparison 
test. 


. Let {.S,,} be the partial sum sequence for the series )77~ , Tr Show that 


Sin > s/n for all n; conclude that the series diverges. 


. In our proof sketch for Proposition 2.27, page 119, we showed that )> 1/k? 


converges if p = 1.3. Show in a similar way that }>1/k? converges for 
every p > 1. (Hint: Writing p = 1 + s simplifies the algebra slightly; then 
s plays the role of 0.3 in the given proof.) 


. Consider the series )>7° , 1/(2*~! + 1), which converges to some limit S. 


(a) Explain why the series converges. 


(b) Maple says that that the tenth partial sum is S19 ~ 1.26255. Give an 
upper bound for fj, the corresponding upper tail. 


(c) Use results of the previous part to give a small interval guaranteed to 
contain S. 


. Consider the series )>P°_,(—1)*~1(1/(2*~! + 1)) (the alternating version 


of the series in the preceding problem). 


(a) Explain why this series converges to some limit S. 


(b) Find an interval of length less than 0.001 that is guaranteed to con- 
tain S. 


(c) By how much can the partial sum S109 differ from the S’?? Why? 


. For any p-series 7°, = with p > 0, it can be shown that all upper tails 


satisfy 


Co 


2.7. Lim sup and lim inf: A Guided Discovery 


(a) Draw a picture to illustrate this inequality. (Leaf through any standard 
calculus text, if necessary.) 


(b) Estimate ree a with error less than 0.001. (Technology will be 
helpful.) 


2.7 Lim sup and lim inf: A Guided Discovery 


This brief section, designed for self-study, explores two useful generalizations of 
the “ordinary” limit of a sequence: the limit superior and the limit inferior (also 
known as the upper limit and the lower limit, respectively). 


Defining the lim sup. Given any bounded sequence {::,,}, we can define the new 
sequence {v,, } 


U1 = sup{%1, £2, %3,...}; v2 = sup{©2, 23, 04,...}; 
Un = sup{@n, 2n41,2n42,---F- 


(We’ll call this the sup-sequence associated to {x,}.) 
Now we can give the definition: 


limsup zy, = lim vy. 
n— co 
In brief: lim sup v7, = limp... sup {xz | k > n}. 


Basic problems. 


1. For each of the following sequences {,, }, find the corresponding sequence 
{vy}; then find lim sup ay. 


@OHen} S41,t0 


4 


pee 


i. 
(b) {ta} = {25g qege b 
wt fd 34-4.) 
eerie 


2. Why must all the vu; exist in the sup-sequence? Why must {v,,} have a 
limit? (Cite appropriate theorems.) 


3. How are lim, and limsup z,, related to each other for a bounded se- 
quence {2,,}? Can one exist but not the other? Can both exist but be 
different? 
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Results of this section are not 
needed in later sections of the 
book. 


We will discuss unbounded 
sequences below. 


2. Sequences and Series 


The lim inf. The lim inf is closely analogous to the lim sup. 


1. Carefully state an appropriate definition for the lim inf of a bounded se- 
quence. Then find lim inf x, for each of the example sequences in the first 
problem. 


2. Find a sequence {x,,} with limsup x, = 5 and liminf x, = —2. 
3. Prove that if limsup x, = 5 and liminf x, = 5, then lim x, = 5, too. 


4. It seems reasonable that liminf x, < limsup z,, for any bounded sequence 
{x,,}. Prove this. What happens if the two are equal? 


Algebra with the lim inf and the lim sup. The ordinary limit has nice algebraic 
properties: for example, lim (2, + y,) = lima, + lim y, (assuming that both 
limits on the right side exist). Do lim inf and lim sup have similar properties? 


1. Suppose lim sup z,, = S, liminf xz, = J, and k is a constant. What can 
be said about lim sup kx, and liminf kx,,? (Hints: Experiment with your 
example from above, and with several possible values of k.) 


2. Iflim sup z, = Land lim sup y, = M, what can be said about lim sup(a, 
+ Yn)? (Show that lim sup(ap, + yn) < L+M.) 


Unbounded sequences. If {2,,} is unbounded above, then the sup-sequence 
{v,} doesn’t exist, but it is natural to define lim sup x, = oo. Similarly, if {x,} 
is unbounded below, then we define lim inf x, = —oo. 


1. Find the lim sup and the lim inf of the sequence —1, —2, —3, —4,.... Find 
a sequence {x,,} with lim sup #,, = co and liminf x, = —oo. 


2. Show that every sequence (bounded or not) has a lim sup and a lim inf. 


3. If limsupz, = +00, must limz, = +00, too? If limsupx, = —oo, 
must lim x, = —ov, too? 


Lim sups, lim infs, and subsequences. The limsup and the liminf of a se- 
quence {x,,} are closely connected to subsequences of {x,,}, as the following 
proposition suggests. 


Proposition 2.33. Let {x} be a sequence. 


(a) If {xn} is unbounded above (so limsup £, = 00), then some subsequence 
{2n,,} diverges to infinity. 


(b) If {a} is bounded above, with limsup %, = L, then some subsequence 
{@n, } converges to L. 


2.7. Lim sup and lim inf: A Guided Discovery 


Proof: Claim (a) is essentially Lemma 2.13, page 96. 
To prove claim (b), let’s construct a subsequence {x,,, } such that, for all k, 


L- < Un, < Vp = sup{Lp, Le41,---; 
where {v;} is the usual sup-sequence. Now the sequences at left and right both 
clearly converge to L, so our subsequence {2»,, } is “squeezed” to the same limit. 

To construct the desired subsequence, observe first that D -1 < DL < uv. 
Thus Z — 1 is not an upper bound for {x1,22,23,...}, so we can find xp, 
with D—1 < ap, < vi, as desired. Similarly, LZ — 1/2 is not an upper 
bound for {2 41, 2n,+2;2n,+3,---}, So there is some x, with ng > ny, and 
L—1/2 < an. < Un, < v2, as desired. Continuing this process produces the 
desired subsequence {%n,,2n.,;Ung,--- }. 


Use these ideas in the following problems. 


1. Show that if lim sup x, = 5, then there is a subsequence {,,, } that con- 
verges to 5. Could some other subsequence converge to 6? To 4? Explain 
your answers. 


2. Show that if {x,, } has a subsequence {2,,, } that converges to 5, then 


liminf t, <5 < limsup7z,. 


3. Show that if limsupz, = ov, then there is a subsequence {2,,, } with 


In, 7 &. 


More on the “sup-sequence.” The following problems explore further the cor- 
respondence between the original sequence {x,,} and the “sup-sequence” {v,, }. 
We can ask, for instance, about “inverting” this correspondence. 


1. For each of the following sequences {v,,}, find (if possible) two different 
sequences {,,} that give the same sup-sequence {v,, }. 


3344 
(a) {Un} —_ {2.23 9°37 3?" 4 
(b) {Un} = {2, 2,2, 2,2, 2, on aa 
(c) {vp} = {2, 1.1, 1.01, 1.001, 1.0001,...} 


2. For which sequences {v,,} can there be more than one associated sequence 
{x,,}? State and prove a claim. 


Limits and Continuity 


3.1 Limits of Functions 


This chapter is about continuous functions and their properties. As a foretaste, 
consider this reasonable-sounding claim about a function /: 


If f(—1) = —2 and f(1) = 5, then f(a) = 0 for some a € (—1, 1). 


Is this true? If a graph passes through the points (—1, —2) and (1,3), must it 
also cross the x-axis somewhere in between? Must there also be an input b with 
f(b) =7? 

Both answers turn out to be “yes”—but only if the function f is continuous 
everywhere along the input interval [—1, 1]. In this case, answers follow from the 
famous intermediate value theorem, which we will carefully state and prove later 
in this chapter. To prove such results we'll draw on earlier work with complete- 
ness, suprema and infima, limits, the Bolzano—Weierstrass theorem, and more. 
Our work starts, as usual, with clear definitions of words like “limit” (as applied 
to functions) and “continuous.” 


Defining Limits of Functions 


Most of us emerge from elementary calculus courses with a rough and ready view 
of function limits. To say that lim,_.3 f(z) = 5, for example, means something 
like 


f(x) approaches 5 as x approaches 3, or f(a) 5 when x 3. 


These views are not incorrect, but they are too vague to be useful for building 
theory or proving theorems. What exactly does “approaches” mean? How close 
to 3 must x be for x & 3 to hold? We need a precise definition. 


Definition 3.1 (Limit of a function). Let f be a function whose domain includes 
an open interval J containing a, except perhaps for x = a. Let L be a number. 
We write lim, ., f(x) = L if, for every € > 0, there exists 6 > 0 such that 


| f(a) — L| <e whenever xz € I and0 < |x —a| <6. 
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3. Limits and Continuity 


Observe, right away, two important subtleties about domains: 


© Atx =a: The value f(a) itself need not be defined for the limit to exist at 
x = a. Indeed, the whole point of finding such a limit is often to do with a 
“missing” or mysterious function value. If f(a) is defined, that’s fine, but 
it’s irrelevant to the definition, which avoids all mention of f(a). 


° Near x =a: For the key e—d condition to make good sense-let alone hold 
true—f(a) must be defined for all x (other than a) that are within 6 of 
a. We will sometimes summarize these conditions by requiring that the 
domain of f include some set of the form J \ {a}, where J is any open 
interval containing a. 


EXAMPLE 1. Let f(x) = (3x? — 3)/(x — 1). Use Definition 3.1 to show that 
lim,—1 f(a) = 6. 


SOLUTION. Note first that all is well with domains: f (1) is undefined, but f(z) 
makes good sense for all other x. Next, for  # 1 we have 
327-3 3(a—-1 1 
f(a) = 28 — OE) XL sets, 
x—1 z—1 
and so 
| f(a) — 6| = [3a +3 — 6] = [82-3] =3|x—-1]. 


Thus, 
38la-1Ij<e = |ax-1/< 7 
This shows that 6 = €/3 “works” for any given ¢, and we’re ready for the brief 
formal proof. 
Let € > 0 be given, and set 6 = €/3. This 6 works, because if 0 < |z—1| < 6, 
then 
327 — 3 
-1 


| f(a) — 6] = -6| =3|2-1] <3 =< 


where we used factoring in the second equality. 
Note that the preceding paragraph stands on its own as a proof—the discussion 
beforehand is there “only” for motivation. ©) 


EXAMPLE 2. Let aand k be any constants. It is no surprise that 


lima=a and limk=k, 
rw—a 2 


but how do these facts follow from Definition 3.1? 


3.1. Limits of Functions 


(a) A good 6 (b) A bad 6 


Figure 3.1. Picturing function limits. 


SOLUTION. With f(a) = a for the first limit f(2) = & for the second, verifying 
Definition 3.1 is an easy exercise. >) 


Picturing limits. Function limits can be understood graphically. For any given € 
and 6, we can look at the tiny rectangular “window” in the xy-plane, centered at 
(a, L), in which D—e€ < y < L+eanda—06 <x <a+0. This window has 
“half-height” € and “half-width” 6; two possibilities are shown in Figure 3.1. 

From this viewpoint, the «—d condition is satisfied if the graph of f stays 
inside this rectangle all the way from left to right, as in Figure 3.1(a). If the graph 
“escapes” at top or bottom, as in Figure 3.1(b), the chosen 6 does not work for the 
given e. The limit is L if, for any choice of ¢ (the half-height), there exists some 
good half-width 6. 


EXAMPLE 3. The function 
1 ifxEeQ 
f(x) = ; 
0 if ¢gQ 


is called the characteristic function of the set Q. Does limz_,o f(x) exist? 


SOLUTION. The (unsurprising) answer is no. To prove this formally, suppose 
toward contradiction that L is a limit, and set « = 0.001. By our assumption, 
there must be some 6 > 0 with 


|f(%) — L| <0.001 whenever x € (—d, 0). 


But (as we’ve shown) every interval (—0, 6) contains both rational and irrational 
numbers. If s is such a rational and ¢ an irrational, then both 


|f(s) — L| =|1—L| <0.001 and |f(t)— L| = |0— L| < 0.001, 
which is clearly absurd. ©) 
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It is useful in illustrating “bad” 
behavior; we'll see it again. Can 
you picture the graph? 
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Limits respect sums, products, 
quotients, etc. 


Let’s prove the contrapositive, in 
other words. 


3. Limits and Continuity 


The next example illustrates what can—and cannot—be inferred from the ex- 
istence of a limit. 


EXAMPLE 4. Let f be a function for which lim,_,, f(z) > 0. Must f(x) > 0 
also hold for x near a? What can be said if lim,_., g(x) = 0? 


SOLUTION. Concerning f, the answer is yes—except at x = a, where we know 
nothing. To see why, suppose lim, f(x) = L > 0, and sete = L/2. By 
definition of the limit, there is some 6 > 0 that “works” for this € in the sense that 


L 
|f(z) -Ll < 5 whenever 0 < |x —al| <0. 
Rewriting these inequalities gives 


L 
f(z) > 5 >0 whenever 2 €(a—06,a+6) and «a, 


as desired. 

Concerning g, the answer is “not much.” Near x = a the function g could be 
positive (if, say, g(a) = (a — a)?) or negative (if, say, g(x) = —(x — a)?). Or 
perhaps g(a) = x — a, which changes sign at x = a. 


Basic Properties of Limits 


Limits of functions have a lot in common with limits of sequences. This is hardly 
surprising—sequences are functions, after all—but it’s a good thing, because 
most of our hard work with sequence limits will transfer readily to function limits. 
It is easy to prove, for instance, that function limits have the same nice algebraic 
properties that sequence limits enjoy. The following technical lemma helps clarify 
the connection. 


Lemma 3.2. Let f(x) be defined on I \ {a}, where I is an open interval contain- 
ing a, and let L be anumber. The following are equivalent: 


(i) liMnsoo f(tn) = L for every sequence {x} C I \ {a} with a, — a. 
(ii) lime—a f(x) = L 


Proof: To show that (i) implies (ii), let’s assume that (ii) fails and construct a 
sequence {x,,} that violates (i). 

For (ii) to fail means that there is some positive €, say €9, for which no 6 > 0 
works. In particular, 6 = 1 fails, so there must be some «x in I \ {a} such that 


O<|a,—al<1 but |f(a)-—L|> oe. 


3.1. Limits of Functions 


Because 6 = 1/2 also fails, there is some x2 with 
0<|zxg-al<1/2 but |f(r2)-L| >. 


Continuing in this way, we construct a sequence {x,,} of points in J \ {a} such 
that, for each n, 


1 
0<|a,—-al<— but |f(r,)— L| > eo. 
n 


In particular, { f(x,,)} does not converge to L. On the other hand, {x,,} converges 
to a; the squeeze principle assures this because 


1 1 
a-—<&%<a+-—- 
n n 
for all n. Thus our sequence {2,,} violates (i), as desired, and we’ve shown that 


(Gi) => (i). 
Showing (11) == (i) is easier; we’ll leave it as an exercise. 


Lemma 3.2 lets us exploit earlier work with sequences to find, without much 
effort, a lot of function limits that would be tedious or difficult to handle from 
scratch. 


EXAMPLE 5. Let f(x) = (3a? + 52 + 2)/(2x — 7). Use Lemma 3.2 to prove 
that lim, 5 f(x) = +. 


SOLUTION. Proving such a thing straight from the e— definition could get ugly. 
With Lemma 3.2, it is easy. 

To get started, let {x,,} be any sequence with x, — 5. By Lemma 3.2, it is 
enough to show that 


322 +5a,+2 102 
lim f(x,) = lim aie ae Se 
n—0o n—00 22n —7 3 
This, in turn, follows from different parts of Theorem 2.5, page 83, which allows 
algebraic combinations like the following for limits of sequences: 


i 322 +5an+2 3(limz,)? + 5(limz,)+2 
im eee ee ee 


_3-57+5-542 102 
a: ne: ae 


as claimed. ©) 
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No worries here about zero 
denominators. 
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We addressed these in 
Example 2. 


3. Limits and Continuity 


New function limits from old. The following theorem does for functions what 
Theorem 2.5, page 83, does for sequences. 


Theorem 3.3 (Algebra with limits). Let f(a) and g(a) be defined for all inputs « 
in I \ {a}, where I is any open interval containing a, and suppose that 


lim f(z)=L and lim g(x) = M. 


Then 
lim (f(c) £ 9(2)) = lim f(x) + lim g(2) = L+ M: 
lim (f(z) - g(2)) = lim f(z) - tim g(x) = L- M; 
igh f (2) = limg—a f(x) ; 
ria g(a) limpoag(z) 97% 
Observe: 


° Existence: Theorem 3.3 is more than a recipe for calculating new limits 
from old; it is also a guarantee that various limits (those on the left of each 
equation) actually exist. In the case of quotients, for existence, one might 
reasonably worry about small or vanishing denominators; Theorem 3.3 as- 
sures us that all is well if M 4 0. 


Something to combine: The how-to-combine-limits rules in Theorem 3.3 
aren’t much good until we have some already-proved basic limits to com- 
bine. The good news is that a few very, very basic limits, such as 


limz=a and limk=k 


go a long way. Applying Theorem 3.3, repeatedly as necessary, lets us cal- 
culate limits like these without too much effort: 


327+ 5¢+2 3-5°+5-54+2 


li i eee Ss 4: 

pa eee at 
31702-5242 317-52 — 5842 aie 

ra LO ao «ADO ae 


(We calculated the first limit, a bit differently, in Example 5.) 


Proof: All parts of Theorem 3.3 follow easily when we combine the analogous 
results for sequences (Theorem 2.5, page 83) with Lemma 3.2, above. Concerning 
products, for instance, we consider any sequence {,,} in I \ {a}. By hypothesis, 
the new sequences {f(z,)} and {g(a,)} converge to L and M, respectively. 
By Theorem 2.5, the product sequence { f(an)g(an)} converges to LM, and 
Lemma 3.2 assures us that lim; 4 f(2)g(xz) = LM, too. The remaining parts 
are similar. 


3.1. Limits of Functions 


Squeezing functions. A squeeze principle works for function limits. We’ve al- 
ready proved the sequence version (Theorem 2.6, page 86): 


Proposition 3.4 (The squeeze principle). Let f(x), g(x), and h(x) be defined 
for all inputs x in I \ {a}, where I is an open interval containing a, and suppose 
that 

f(z) < g(a) < h(a) forallx € I \ {a}. 
If limy—sa f(x) = Land lim, , h(x) = L, then lima g(x) = L, too. 


The proof, like that of Theorem 3.3, amounts to combining the sequence version 
with Lemma 3.2. We’ll leave that to the exercises. 
The limit-squeezing idea is simple. The tricky bit in practice is to find helpful 
squeezing inequalities. 


EXAMPLE 6. Squeeze something to find limy_,9 x sin(1/2) and lim; . 


x 


SOLUTION. The fact that |” sin(1/x)| < || for all 2 # 0 suggests a simple 


squeezing inequality: 
1 
— |r| < x sin (=) < |e]. 
x 


Clearly, +|x| — 0 as x — 0, so the middle quantity tends to zero, too. 
Finding a good squeezing inequality for the second limit takes more effort. 
Here is one possibility: 


sin x 
cosx < 


ss 


if —1,1 d 0. 
aaa ifee(—-1,1) and «F 


This does the job: Since cosx — 1 as x — 0, the left- and right-hand functions 
tend to 1, and hence so does the middle function. The squeezing inequality needs 
proof too, of course; we’ll leave that to the exercises to avoid distraction. © 


Beyond Vanilla: More Limit Varieties 


We’ ll find uses for several variations on the basic limit theme discussed above. 
Here are some samples of limits at infinity and infinite limits: 


4x +3 1 
as =2; lim >=; lim (x? — x) = 00. 
roo Qa + x20 2X 2-00 


Here are some one-sided limits: 
; ; ue . xr—3 
lim Ina =-—oo; lim ——==0; lim ——=- 
xr—0F x ot 1+ Ja L337 |3 _ x| 
Following are several formal definitions; we leave some others, all in the same 


spirit, as exercises. As with Definition 3.1, each part here involves some technical 
assumption about domains, needed to ensure that the key inequalities make sense. 
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Plotting these functions might 
be useful. 


Familiar functions from calculus 
seldom cause trouble on this 
score. 


3. Limits and Continuity 


Definition 3.5 (Variant limits). | Let f be a function and let a and L be real 
numbers. 


Right-hand limit: Let f (a) be defined for all inputs x in some open interval 
I = (a,b). We say lim, _.,+ f(a) = L if, for every « > 0, there exists 
5 > 0 such that 


|f(a) — L| <e whenevera € Janda<a<a+0o. 


Left-hand infinite limit: Let f(x) be defined for all inputs z in some open 
interval JT = (b,a). We say lim,_,,- f(x) = —oo if, for every M > 0, 
there exists 6 > 0 such that 


f(a) <—M_ wheneverz € Tanda—6 <2 <a. 


Limit at infinity: Let f(a) be defined for all inputs x in some open interval 
I = (b,c). We say limz_.0. f(x) = L if, for every « > 0, there exists 
N > 0 such that 


|f(v) —L| <e wheneverz € Jandx> WN. 


Infinite (two-sided) limit: Let f(a) be defined for all inputs x in an open 
interval J containing a, except perhaps at x = a. We say lim,.4 f(a) = 00 
if, for every M > 0, there exists 6 > 0 such that 


f(a) > M_ whenever x € I and0 < |x —al <0. 


Infinite limit at infinity: Let f(a) be defined for all inputs x in some open 
interval I = (b, 00). We say limy_.0. f(x) = 00 if, for every M > 0, there 
exists N > 0 such that 


f(z) > M_ whenever «> N. 


EXAMPLE 7. Prove these limits: 
hos LT .  d@+2 ; Qa 
(a) Jim = = 0; (b) Jim is 3; (c) Buss aoe 0 


SOLUTION. Note first that the functions in all three limits are fine as regards 
domains. The first is defined for x > 0, the second for x > —5, and the third for 
x> 0. 

Limits (a) and (b) have close analogues for sequences—limy,_, 5, 4 = 0 and 
limn—oo (32 + 2) /(n +5) = 3—and the proofs for functions are almost identical 
to those for sequences. For (b), for instance, we proved the sequence version in 
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Example 2, page 75, and the function version is almost the same. First, we see 
that for any positive e, 


3x2 +2 —-3(4+5) 13 


x+5 e+5 


3x2 +2 3 — 
£+5 - 


(It is OK to drop the absolute value because x + 5 > O certainly holds for the 
large positive x in which we’re interested.) Now 
13 
x+5 


13 
<e => —-5<Q, 
€ 


and so for any givene > 0 wecan set M = 13/e. This M works in the appropriate 
definition, because if x > M, then (skipping some details from just above) 


3x +2 ; 187... 43 


25 | eee 


as desired. Limit (a) is left to you. 
For (c), only positive x matter, and so 
ae 0 es <2 < => < = 
— — 0 € x = 
1+ fx 2 


1+ Jz 


Thus, for given « > 0 the value 6 = €/2 works: if 0 < a < 4, then 


22 
14+ Vz 


as the definition requires. ©) 


=o] =< 22 < 25 =<, 


Allin the limit family. All of these limit variants are close kin to each other. Limits 

at infinity, for instance, are one-sided in the sense that oo is approachable only 

from below, and —oo only from above. The following proposition makes some of Similar results apply to limits 
this kinship explicit: that involve —oo. 


Proposition 3.6. Let a and L be finite numbers, and f a function. 

(i) limz—a f(x) = L ifand only if lim,_.,+ f(x) = lim,_,,- f(x) = L. 
(ii) limg—oo f(x) = L if and only if lim,_,9+ f (4) = L. 

soe . fae . . . 1 co 
(iii) limy—oo f(x) = co if and only if limz—o Fay = 0. 


Observe, especially, what the proposition says about existence: if the limits on 
either side of “if and only if” exist, then so must the limits on the other side. 
We will leave formal proofs to the exercises, but illustrate the idea of (i) with an 
example. 
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Here “right” and “left” mean 
xX > 0 and x < 0, respectively. 


3. Limits and Continuity 


Figure 3.2. A piecewise-defined function. 


EXAMPLE 8. Figure 3.2 shows the function 


x/2 ifx <0, 
g(x) = ; 
2xesin(1/x) ifx > 0. 


Show that lim, 0 g(a) = 0. 


SOLUTION. Let € > 0 be given; we need a 6 > 0 that works for «. Because of 
g’s peculiar two-sided nature, we’ll first find positive numbers diem and dright that 
work for € on the left and on the right of zero, respectively. 

Matters are simplest on the left. There g(x) = x/2, and so 


\g(x) — O| = 5 | <e <=  |2| < 2¢, 


which means that die, = 2€ works (and also that lim, _,9- g(x) = 0). On the 
right, we have g(x) = 2a sin(1/2), and so 


lg(z) — 0| = |2rsin(1/z)|<|22)<e <3 |al< = 
which means that dyignt = €/2 works (and that lim,,_.9+ g(x) = 0). 
Combining these results means that |g(2) — 0| < € holds for all nonzero x in 
the asymmetric interval 


(—dteft, Oright) = (—€/2, 2€). 


Obviously, |g(a) — 0| < € also holds for nonzero x in the (smaller) symmetric 
interval (—dier, diet). This means that 6 = die = €/2 works for ¢€ in the desired 
limit. We’re done. ~) 


3.1. Limits of Functions 


More algebra. These newfangled limits—if finite—enjoy all of our by-now- 
familiar algebraic and squeezing properties. Infinite limits, on the other hand, 
need special care. 


EXAMPLE 9. _ Find and prove (or re-prove) these limits: 


1 3 2 
dim, ape Spe ime? 1000; im Va Be a, 


SOLUTION. The first limit yields to squeezing. For all x > 0, we have 


1 


0 < ———- < 
r+ J/e+3 


d . 
x d 
since lim;_.6 + = 0, we must have lim, 45 1/(a + \/z + 3) = 0, too. 
We found the second limit from scratch in Example 7. With algebra we can 
reduce the limit to something simpler (also handled in Example 7): 


3a +2 _ ies 34+2/e  3+2lim1/z a0 


woo E+5  e-col+5/e 1+5limi/e 1+0 


The limit lim,_... 2” — 1000z involves the difference of two quantities, each 
tending to infinity. Which tendency “wins”? It is easy to guess that x? over- 
whelms 1000: for large x, and so, presumably lim,_,.. x? — 10002 = oo. Alge- 
bra helps confirm this. For any given M/ > 0, we have 


x” — 10002 = x(z — 1000) > M_ ifx > Mandz > 1001, 


so we can use N = max M, 1001 in the appropriate definition. Check details for yourself. 
The function Vx? + 27 —- is another difference of two quantities that diverge 
to infinity. Plotting the function or plugging in large values of x suggests that the Try it. 


limit is one. We can show this algebraically—with a little effort. The trick is to 
multiply and divide by the conjugate expression: 


2 = 2 
Y oae rae (Va +22 x) (Va + 2¢ + 2) 
Var2+2r+a 
2x 


2 
fe? 4+2n+a V/14+2/e4+1 


(We divided numerator and denominator by in the last equality.) This is progress: 


to show that lim... Vx? + 2x—a = 1 itremains only to show that \/1 + 2/a > 
las x — oo. We'll leave this reasonable-seeming result to the exercises. ©) 
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Exercises 


1. We said in Example 2 that if a and k are any constants, then 


limaz=a and limk=k. 


r~—a 


Use Definition 3.1 to prove this. 


2. Guess a value for each of the following limits; prove your answers using 
Definition 3.1, page 133. 


(a) lim 22 +3 
2 


ge? — 
b) li 


2 

—2 
© in 
2+ sinz 
3—cosx 


(im, 2 


3. For a function f and an input « = a it may (or may not) happen that 
limz.a f(x) = f(a). If f(~) = 3x+5, for instance, it is easy to show that 
lim, 42 f(~) = 3-424 5 = f(42). (Ata domain endpoint, we’d use the 
appropriate one-sided limit.) 


In each case following, decide whether this happens. If so, prove it; if not, 
say why not. (Theorem 3.3, page 138, may be useful.) 

a) Fear ie A2, 

(b) f(x) = x7; a = any constant. 


2 
© f(a) =L 20-2 
xr—2 
2. 
O@y@= owas 
x—2 


©) f 
(f) f 


(g) f(x) =1+ 72 + ex? + rex?; a = any constant. 


i eee Oe 


x) = |r\|;a=0. 


( 
( 
( 
( 


4. Prove Proposition 3.4. (One way is to use Theorem 2.6, page 86, and 
Lemma 3.2.) 


5. Show that (ii) implies (i) in Lemma 3.2. 
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6. In Example 6, we used the squeezing inequality 


sept ee u 
~ @ ~ cosé 


(We said x, not 0.) Show as follows that this holds for nonzero @ in 
(—1/2, 7/2). 
(a) Explain why it is good enough to show this for 0 € (0, 7/2). 


(b) Give a geometric proof of the (equivalent) inequality sin@cos@ < 
@ < tan@. (Hint: Draw the angle @ into the first quadrant of the 
unit circle in the usual way. Then find right triangles whose areas 
represent the left- and right-hand quantities above. What area does 0 
represent?) 


7. We defined several variations on the basic limit theme in Definition 3.5. 
Here we explore two more members of the family. 
(a) Give a precise definition for the expression lim,,_. f(x) = L. 
(b) Give a precise definition for lim,_,9- g(a) = L. 
(c) Use the preceding parts to show that lim,._. f(x) = L if and only 
if lim, _.9- f(1/x) = L. 
8. This problem explores limit properties of the function f(x) = \/z. 


(a) Let a > 0. Show lim, f(x) = f(a) = a. Use an &-6 proof; the 
inequality 


ar |x — al |x — al 
eel aie ae 


may help. 

(b) Show lim, 9+ f(x) = f(0) = 0. Use an e-6 proof. 

(c) We claimed in Example 9 that J1t+2/zx — 1 (or, equivalently, that 
J1t+2/z —1-— 0) as x — oo. Deduce this by showing that the 
squeezing inequality 0 < J14+2/z —1< 2/z holds for all x > 0. 


9. Use a computer plotting utility in this problem, which explores the graphi- 
cal interpretation of Definition 3.1, page 133. No proofs needed. Through- 
out, let f(a) = x? — 62. 


(a) Assume (it is easy to show, but don’t bother) that lim, f(x) = —5. 
For € = 0.01, find a positive 6 that works, and illustrate your answer 
by sketching the graph of f in a well-chosen window of half-height € 
and half-width 6. 
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Even sharp ones. 
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(b) It is a fact that lim,—100 f(z) = 9400. For « = 0.01, find a positive 6 
that works, and illustrate your answer by sketching the graph of f in 
a well-chosen window of half-height € and half-width 6. 


(c) It is true that lim, 10 f(z) = 40. Set « = 0.01. Does 6 = 0.001 
work in this case? Sketch the graph of f in an appropriate window to 
illustrate your answer. 


. Suppose that f(x) = 0if « ~ 0 and f(0) = 42. Explain carefully why 


lim, f(#) = 0 for all real numbers a. 


. Suppose that g(x) = Oif « ¢ Zand g(n) = 42 if x € Z. Explain carefully 


why limz_.a g(x) = 0 for all real numbers a. 


. Suppose that h(a) = 42 if ¢ = 1,1/2,1/3,... and h(x) = 0 otherwise. 


What can be said about lim, _., h(x)? Why? 


. Let S be a finite set of real numbers. Suppose that j(s) = 42 ifs € S$ 


and j(2) = 0 for all other real x. Show that lim,_., j(a) = 0 for all real 
numbers a. 


Explain why the function g in Example 8, page 142, satisfies the squeez- 
ing inequality —|2z| < g(x) < |2a]. Use this to give another proof that 
lim,—o g(x) = 0. 


3.2 Continuous Functions 


In everyday use the word “continuous” means something like “unbroken” or 
“without gaps.” In elementary calculus, calculus, for instance, a continuous func- 
tion may be thought of as one whose graph can be drawn without lifting the pencil. 
Such views can be useful, but they need sharpening for mathematical purposes. 
Pencils are not mathematical objects; to get started we need to describe continuity 
in mathematical language. Function limits are the key. 


Definition 3.7 (Continuity of a function at a point). Let the function f be defined 
for all inputs x in an open interval J containing a. We say f is continuous at x = a 
if, for every « > 0, there exists 6 > 0 such that 


\f(x) — f(a)| <e€ wheneverx €Jand |xr-al <6. 


Otherwise, f is discontinuous at x = a. 
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Observe: 


© In terms of limits: All those e’s and 6’s suggest a lurking limit. Sure enough, 
the definition boils down to this: 


lim f(e) = f(a). 
In words: The value f(a) is also the limit of f(a) as x approaches a. 


© No domain gap allowed: For f to be continuous at a it is necessary—but 
not sufficient—that lim, _., f(x) exist. Two additional requirements apply: 
(i) the value f(a) must exist; (ii) f(a) must be the limit. 


© What if ais an endpoint? Is the function f(a) = ./x continuous at x = 0? 
The answer will turn out to be yes—but note that the definition above 
doesn’t say either way, because f is not defined on any open interval con- 
taining x = 0. Later we will treat the case where x = a is an endpoint of a 
function’s domain; right now we will stick to the basic version. 


© No surprises: Continuity of f at a means, roughly speaking, that f “springs 
no surprises” at a: The value f(a) can be inferred from values f(x) for x 
near a. In terms of € and 6, the condition is that f(a) stays within € of f(a) 
as long as x stays within 6 of a. 


Let’s see some examples and non-examples. 


: : 2 : 
EXAMPLE 1. Where is the function f(a) = st continuous? Can any 


discontinuities be “fixed”? 


SOLUTION. The live question for any given a is whether 


: . a -—Ar—5 a?—4a-5 
pode aoe en 
Thanks to algebraic properties of limits the answer is yes—except perhaps at 
a = +5, where the denominator vanishes. So f is continuous at all a 4 +5. 
What if a = +5? The short answer is that f is discontinuous at both points, 


because both f(5) and f(—5) are undefined. On the other hand, factoring gives 


z—5)(e@4+1 xr+1 
f(a) — MEAD) _ 
(c—5)\(a+5) «+5 
for « £ +5, from which it is easy to see (or prove, but we won’t bother) that 


; 6 : : 
lim f(x) = i0° but tim. f(x) does not exist. 


This means, in turn, that we can make f continuous at a = 5 by defining f(5) = 
3/5, but that the discontinuity at a = —5 is there to stay. A look at the graph of f 
suggests the same conclusion. © 
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Summarized in Theorem 3.3, 
page 138. 


Plot fto see for yourself; a gap 
might appear at x = 5, 
depending on the technology. 
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Sketch your own. 


Such sequences do exist. 
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0.54 


-1 -0.5 0 0.5 1 


Figure 3.3. A function continuous at only one point. 


EXAMPLE 2. Consider the functions 


-1 ifa<0, ee 
sign(x)= 40 if#=0, and g(x) = fi id , 
1 ife>0, 


Figure 3.3 hints at the graph of g. 


SOLUTION. As its graph suggests, the sign function is discontinuous at a = 0, 
because lim,,_,9 sign(x) does not exist. But the sign function is continuous at all 
a # 0, because (as the graph also suggests) lim,_., sign(x) = +1 = sign(a). 
Formal proofs are straightforward; see the exercises. 

The function g is stranger, and its graph is much weirder than Figure 3.3 
suggests. For example, neither “line” contains any unbroken segment, but each 
“line” contains infinitely many points, so densely packed that between any two 
points lie infinitely many more. 

Bizarre as g seems, it is not hard to show that g is continuous at a = 0 but 
discontinuous elsewhere. To show continuity at a = 0, we need to check that 
lim,_.9 g(x) = g(0) = 0. This follows from the squeezing inequality 


—|2| < g(x) < la]. 


Since the left- and right-hand quantities clearly tend to zero, so must g(x). 

For a # 0, on the other hand, lim,_,, g(a) does not exist. One way to prove 
this is to consider two sequences {r,} and {p,}, each converging to a, with 
rn € Qand p, ¢ Q for all n. Then we have 
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Gn) = a while g(pn) =0 — 0; 
thus, lim... g(a) does not exist, and so g is discontinuous. © 


Continuity at domain endpoints: use one-sided limits. The domain of f(x) = 
/z includes the point 0, but nothing to the Jeft of 0, and so the basic defini- 
tion can’t apply. Still, it seems reasonable to call f continuous at 0, because 
f(x) = Vz & f(0) = 0 for all positive x near 0. The following definition offers 
a natural way out of this mild impasse. 


Definition 3.8 (Continuity at a left endpoint). Suppose that f is defined on an in- 
terval I = [a, b), but not on any open interval containing a. We say f is continu- 
ous at x = aif, for every € > 0, there exists a number 6 > 0 such that 


| f(x) — f(a)|<¢« whenevera @ land a<a<a+td. 
Equivalently, f is continuous at x = aif lim,_.,+ f(x) = f(a). 


The idea for continuity at a right endpoint is essentially the same. In practice, 
endpoint continuity comes up less often than the standard version—after all, an 
interval has only two endpoints, but infinitely many points in between. 


EXAMPLE 3. Check that the function f(x) = ,/z is continuous at x = 0. 


SOLUTION. We only need to show that lim,_.9+ Jr = J/0 = 0. Doing so is 
straightforward. For any € > 0, the choice 6 = €? works: 


if O<2<6, thn 0<\V2r< Vo=e, 
as desired. ©) 


Continuity on a set. As Examples | and 2 illustrate, a function may be con- 
tinuous at some points of its domain, and discontinuous elsewhere. Here is the 
pertinent definition: 


Definition 3.9 (Continuity on a set). A function f is continuous ona set S if f is 
continuous at each point a € S. 


We showed in Example 2, for instance, that the strange function g is contin- 
uous on the set {0}, but not on any larger set. The sign function, by contrast, is 
continuous on the complementary set R \ {0}, and the function f in Example 1 
is continuous on R \ {—5,5}. The function f(x) = ./z in Example 3 is continu- 
ous on [0, 00). It is possible to concoct terribly-behaved functions, continuous on 
small or strange sets, but continuity is the rule for the familiar functions studied 
in calculus. 


We leave it to you. 
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New Continuous Functions from Old 


As we’ve already done with limits, sequences, and other objects, we can combine 
“old” functions in various ways to produce new ones. We will show, next, that 
combining continuous ingredients usually produces continuous results. 


Elementary functions. The standard functions of calculus have formulas, such as 


5 . _ (COSZ cos x 
be+sinz), s ( ) | and arctan (1 
exp(a* + 5a+sinz), sin and arctan ( n (= zs :)) 


Such functions—even the nasty ones—are called elementary, because they’re 
built up, using composition and algebra, from simpler basic “elements,” like x, 
sin x, and In z. 

Elementary functions come in families. Here are several: 


* polynomials: functions of the form p(x) = ap +a,2 + a227 +++-+an2”, 
where n is any positive integer and the a; are constants; 


® rational functions: functions of the form r(a) = p(a)/q(«x), where p and 
q are polynomials; 


® exponential and trigonometric functions: exp x, Sin Z, cos x, tan x, sec x, 
csc @, cot x. 


® inverse functions: logs, inverse trigonometric functions, roots, etc., such as 
Ing, arctan 2, /2; 


* hybrids: functions formed from those above by composition and algebraic 
combinations. 


Are elementary functions continuous? The short answer is yes—but only at 
points a where their formulas make good sense. (If a happens to be a domain 
endpoint, then continuity is understood in the sense of Definition 3.8.) In other 
words, an elementary function is continuous throughout its natural domain. This 
domain condition is often clear at a glance. The middle function above, for in- 
stance, is obviously undefined at x = 0, so there’s no hope of continuity there. 
The right-hand function, by contrast, has an uglier natural domain—both the arc- 
tangent and the log might cause trouble. 

Showing continuity of elementary functions, even ugly ones, on their natural 
domains is easy once we know that the basic functions (x?, 1 /ax, sina, e*, Ina, 
etc.) are all continuous on their domains. We address this problem briefly below— 
but only briefly to keep focus on our main concerns. 


Proposition 3.10. Each of the functions 
c(a) = k(any constant), i(z) =a, sina, arctanz, e”, Ing 


is continuous on its natural domain. 
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Continuity of c and 7 at any input a amounts to nothing more than that limz_._ c = 
cand lim;_,, 2 = a; both claims are very easy to show. Proving continuity of 
the remaining functions rigorously takes more effort—starting with clear defini- 
tions of these functions. We omit details here, but see the exercises for further 
discussion. 


Continuous combinations. Now that we have some continuous functions to com- 
bine, let’s combine some, both algebraically and with composition. 


Proposition 3.11 (Algebraic combinations are continuous). Suppose that func- 
tions f and g are continuous at a point a. Then the functions 


fo, fo, and * 
g 


are all continuous at a (for f /g we require g(a) # 0). 


Proposition 3.12 (Composites are continuous). Suppose g is continuous at a 
and f is continuous at b = g(a). Then the composite function f o g(x) = f(g(x)) 
is continuous at x = a. 


Proof (s): Proving Proposition 3.11 boils down to verifying algebraic properties 
of limits. We’ve done that already; see Theorem 3.3 (page 138) and its proof. 


We’ll prove Proposition 3.12 in good e—d style. For simplicity we’ll assume The proof involves two 6's, 
that f and g are defined on open intervals about b and a, respectively. To this end, chosen one after the other. 
let € > 0 be given. Since f is continuous at b, there exists 6; > 0 such that In other words, a and b are not 


domain endpoints. 


f(y) — f(b)| <e whenever |y— | < 61. 
Similarly, since g is continuous at a and 6, > 0 there exists a 62 > 0 so that 
\g(x) — b| < 61 whenever |x — al < do. 
To finish the proof, note that this 62 works for the original € > 0: 


jz — al <2 => |g(x) — g(a)| < 61 
= I|f(g(2)) - f(gla))l <6 


as desired. We will explore another proof, 
using sequences, in the 


Now we can put the pieces together. SACICISS: 


Proposition 3.13 (Elementary functions are continuous). The elementary func- 
tions are continuous at all points of their natural domains. 
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About proofs. The general point is that elementary functions are combinations, 
of the types treated in Propositions 3.11 and 3.12, of the continuous functions 
discussed in Proposition 3.10. For instance, writing the polynomial p(x) = 327+ 
mx — 7 in the form 


3027 +nrx —-7T=3-a-r+n7-2-7 


shows explicitly that p is built by multiplication and addition from constant func- 
tions and the function i(z), both of which are continuous according to Proposi- 
tion 3.10. 

Continuity of all six basic trigonometric functions on their domains follows 
from continuity of the sine function and such identities as 


F sin & 
cosx =sin(e«+7/2); tanz= > secr= 
cos & 


In a similar way, the functions 


2 = 
r(x) = chee ie and g(x) = arctan (1m ( ee )) 


x2 —1 


are built up from functions shown (or claimed) to be continuous in Proposi- 
tion 3.10. A full proof for all cases would require rigorous definitions and proofs 
of continuity for logarithmic, inverse trigonometric, and other such functions. 


Exercises 
1. Let A, B, and c be any real numbers. Use the ¢«—d definition to show that 
f(a) = Ax + B is continuous at c. Does it matter if A = 0? 


2. Suppose that f is continuous on R and that f(x) = 0 if z is rational. Show 
that f(x) = 0 for all x. 


3. (a) Show that f(x) = |x| is continuous at every a € R. 
(b) Show that if g: J — Ris continuous on J, then so is h(x) = |g(x)|. 
4. If f : I — Randg: I — R are any functions defined on J, then 
we can form new functions max{f,g} and min{ f,g} in the “obvious” 


way: max{f,g}(a~) = max{f(x),g(x)} for all x € J, and similarly for 
min{ f,g}- 


(a) Sketch graphs of max{ f, g} and min{ f, g} if f(a) = sin x and g(x) = 
cos x on I = [—27, 27]. 
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(b) Show that 


max{ f(x), g(x)} = —————— 


Find a similar fact “formula” for min{ f(a), g(a)}. 


(c) Show that if f and g are continuous on J, then so are max{ f, g} and 
min{ f, g}. (Hint: Use (b) and the preceding problem.) 


5. We said in Example 2, page 148, that the sign function is discontinuous at 
a = 0 but continuous everywhere else. Prove this in two parts: 


(a) lim,—o sign(x) does not exist; 
(b) lim, sign(a) = sign(a) if a £ 0. 


6. Suppose f(3) = 5 and f is continuous at 3. Show that there exists 6 > 0 
such that 4.999 < f(x) < 5.001 for x € (3 —6,3+). 


7. Suppose f(3) > 5 and f is continuous at 3. Show that there exists 6 > 0 
such that f(x) > 5 fora € (3 — 6,346). 


8. Let f be the function whose graph consists of the two line segments joining 
(0,0), (1, 1), and (2, —1). Show that f is continuous at z = Oandat x = 1. 
(In fact, f is continuous on all of [0, 2].) 


9. A function f has a removable discontinuity at a point x = a if (i) f is 
discontinuous at a; but (ii) we can define (or redefine) f(a) so that f be- 
comes continuous at x = a. In each part, decide whether the function has a 
removable discontinuity at the given point. If so, explain how to remove it. 
If not, why not? 


4 I —, 
(c) f(x) =xsin=3a=0. 


10. Let J be an open interval containing zero, and f : J — R any function that 
is bounded on J. Define a new function g : I — Rby g(x) = zf (2). 


(a) Show that g is continuous at x = 0. 


(b) Suppose a # 0. Show that g is continuous at x = aif and only if f is 
continuous at « = a. 
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We made up the fancy name. 
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3.3 Why Continuity Matters: Value Theorems 


In the preceding section we defined continuity of functions, and investigated many 
examples and a few non-examples. In this section we explore some good-behavior 
properties of continuous functions. Most notable are two famous theorems on 
“intermediate” and “extreme” values of continuous functions. 


Questions of values. Continuity is a form of good behavior, so it’s natural to 
wonder what other pleasant properties continuous functions might have. Some 
such properties concern output values of a function f. We might reasonably hope, 
for instance, that if f(1) = —7 and f(2) = 42, then for any number v between 
—7 and 42 (the value v is “intermediate” between f(1) and f(2)) there should 
be some input c between 1 and 2 with f(c) = v. The famous intermediate value 
theorem (Theorem 3.16 in this section) guarantees that this happy outcome does 
indeed occur, provided that f is continuous on the entire interval [1,2]. In short, a 
continuous function f “assumes all intermediate values” between f(1) and f(2). 
Put graphically, the claim sounds reasonable: If the graph of f starts at height —7 
and ends at height 42, then it must pass through all “intermediate” heights along 
the way. A rigorous proof is another matter. 

Another value-related question concerns extremes: Must the function f above 
achieve smallest and largest values somewhere on the interval [1, 2]? If f happens 
to be increasing, there is no question: f(1) = —7 and f(2) = 42 are clearly 
the desired “extreme values.” But arbitrary continuous functions can be much 
choppier; must every one achieve maximum and minimum values? The famous 
extreme value theorem (Theorem 3.17 in this section) says yes—again provided 
that f is continuous on the entire interval [1, 2]. 


Properties of Continuous Functions 


Our proofs of the big “value theorems” depend on some basic properties of contin- 
uous functions. These properties are of independent interest, offering new insight 
into what continuity means. 


Sticky inequalities. Our first such property says, roughly, that if a continuous 
function satisfies a strict inequality at a point, then the same inequality holds 
(“persists”) for inputs near that point. 


Proposition 3.14 (The Principle of Persistent Inequalities (PoPI)). Let f be con- 
tinuous at c and defined on an interval I containing c; let IX be any constant. If 
f(©) < K then there is some 6 > 0 such that 


f(a) <K forall x«€(c—6,c+9). 


(A similar PoPI holds if f(c) > K.) 
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Figure 3.4. The function h(a) = sin(1/a) /x; badly unbounded near x = 0. 


Proof: For simplicity, we handle the case where c is not an endpoint of J. The 
case where c is an endpoint is similar. 

Supposing, then, that f(c) < K, we sete = K — f(c) > 0, and choose 6 > 0 
according to the definition of continuity of f at c. This 6 satisfies our present 
claim, because if |a — c| < 6, then 


If(z) — (| <e=K— fic). 
In particular, we have 


f(x)- fl) < K— fle) = fla) <Kk, 


as desired. 


Boundedness. What, if anything, does continuity have to do with boundedness 
of a function f on an interval J? It is clear from easy examples that a continuous 
function f need not be bounded on J, even if J itself is bounded. For instance, the 
functions 


fe) =4, o(2)=++—4, and n(n) = BC) 
x zc «-1 x 
are all continuous on J = (0,1), but f is unbounded above on J, g is unbounded 
above and below, and h behaves even worse. As Figure 3.4 suggests, h is un- 
bounded both above and below on every interval (0,¢€). But there’s good news 
about continuous functions on closed intervals, and the proof involves an old 
friend. 


Proposition 3.15. Let f be continuous on the closed and bounded interval |a, 0). 
Then f is bounded on |a, 6); that is, there exist numbers m and M such that 


m< f(%) <M forall € {a, bj. 
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Below by a and above by b. 


a€ &, for instance. 


By 6, for instance. 


That’s Proposition 3.14, 
page 154. 
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Proof: We will show that any function f with the given properties is bounded 
above; the proof for lower bounds is similar. Suppose, toward contradiction, that 
f is unbounded above. Because one is not an upper bound there must be some 
x1 € [a,b] with f(z,) > 1. Because two is also not an upper bound, there is 
x2 € [a,b] with f(x2) > 2. Continuing this process, we can construct a sequence 


1,02, %3,... with f(a,) >n_ foralln. 


Now the sequence {z,,} is bounded and so, by the Bolzano—Weierstrass theorem, 
has a subsequence {1;,,,, } that converges to some limit xo € [a, 6]. 

Here comes our contradiction. Because f is continuous at xp and xp, — Xo, 
we must have f (an, ) — f(xo), too. This is impossible: for all k we have 


f (fn,) > Me > k. 


This means that the sequence { f (x, )} is unbounded, and hence divergent. 


Proposition 3.15 applies, by the way, to the three functions f, g, and h discussed 
above—as long as we restrict domains to closed intervals, such as [0.002, 0.9998], 
on which all three functions are continuous. (On this interval, —500 < h(x) < 
500, for instance.) 


Two Big Theorems 


With Propositions 3.14 and 3.15 at the ready we can prove our two main theorems. 


Theorem 3.16 (Intermediate value theorem (IVT)). Let f be continuous on |a, b], 
with f(a) # f(b); let v be any number between f(a) and f(b) (i.e, f(a) <u < 
f(b) or f(b) <u < f(a)). Then there exists c in (a,b) with f(c) = v. 


Proof: We discuss the case f(a) < v < f(b); the argument for the case f(b) < 
vu < f(a) is similar. 

Consider the set S = {x € [a,}] | f(x) < v}. Because S is nonempty and 
bounded above, it has a least upper bound—say, c—somewhere in [a, b]. We’ll 
show that c has all the properties claimed in the theorem. 

Our first claim is that f(c) < v. This is trivial if c € S. If c ¢ S, then there is 
a sequence {s,,} of members of S with s,, — c. Because f is continuous at c we 
must have f(s,) > f(c), and since f(s,) < v for all n it follows that f(c) < v. 
We see, too, that c 4 b, since we know f(b) > v. 

To complete the proof, we show that f(c) < v is impossible; the PoPI is the 
key. Indeed, if f(c) < v holds, then the PoPI says that f(x) < v must also hold 
for x in some small interval (c — 6,c + 6). This is absurd—we chose c so that 
f(a) > v for all x > c. Thus f(c) = v, as desired. 


3.3. Why Continuity Matters: Value Theorems 


Theorem 3.17 (Extreme value theorem (EVT)). Jf f is continuous on |a, |, then 
f assumes both a maximum and a minimum value on [a,b]. That is, there exist 
Lmin ANd Lax in |a, b] such that 


f (Lmin) < f(x) Ss f (Ginake) for allx € la, b}. 
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Notice especially the important words “assumes,” “maximum,” and “minimum”: 
The theorem guarantees that f achieves, not just approaches, biggest and smallest 
values on [a, b}. 


Proof: Consider the output set 
R=range(f) = {f(x) | x € [a, b]}. 


Proposition 3.15 says that R is a bounded set of real numbers. Since R is also 
nonempty, the completeness axiom guarantees the existence of an infimum a and 
a supremum (3. To finish the proof, we need only show that a and (@ are members 
of the range. With help—again—from Bolzano and Weierstrass we’ll handle the 
case for (3. 

Recall first that, since G = sup(R), there is a sequence {y,,} contained in 
R with y, — G. Uf @ € R, there is no harm done—we can just take y, = ( 
for all n.) Now for each y,, there is at least one x, in [a,b] with f(@n) = Yn. 
Choosing any one of these for each n produces a new sequence {,,} such that, 
for all n, 

aA<a& <b and f(an) = Yn. 

The sequence {x,,} itself need not converge, but the Bolzano—Weierstrass theo- 
rem guarantees that some subsequence {x,,, } converges, to a limit we'll call @max; 
clearly, %max € [a,b]. Since f is continuous at @max and tp, — Lmax, We Must 
have 


f (max) = jim, f (@n,.) = jim, Ynk = dim, Yn = 2B, 


as claimed. 


Bad values. Both the IVT and the EVT may fail, of course, if important hy- 
potheses are violated. For example, the function f(a) = 1/2 is continuous on the 
open interval J = (0,1), but assumes neither a maximum nor a minimum value 
(for different reasons) on J. And the sign function, defined but discontinuous 
on the closed interval [—10, 10], assumes only one intermediate value between 
f(—10) = —1 and f(10) = 1. (On the other hand, the sign function does assume 
maximum and minimum values.) 


Using the Value Theorems 


The intermediate and extreme value theorems (IVT and EVT) are essential and 
versatile tools for studying continuous functions. We illustrate with several brief 
examples. 


We defined “range” in 
Section 1.4. 


The case for a is almost 
identical. 


See Example 2, page 148. 


Which one? 
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Don’t confuse the sine and sign 
functions. 


The range might be an infinite 
interval, like R. 


“Root” seems less ambiguous; 
we'll use it. 


Plotting p. 


3. Limits and Continuity 


Continuous functions map intervals to intervals. If f : J — R is any function 
whose domain is an interval, then the range of f can be almost any subset of R. 
For a constant function, for example, the range is just one point, while the sine 
function has range [—1, 1], and the range of the sign function is the three-point set 
{—1,0, 1}. For continuous functions, the picture is simpler. 


Proposition 3.18. If I is an interval and f : I — R is continuous, then the range 
of f is either a single point (if f is constant) or an interval. 


Proof: Let J denote the range. Recall what it means to be an interval: if u € J 
and w € J (we might as well assume u < w) and v is any number such that 
u< vu < w, then we need to show that v € J, too. This follows immediately 
from the IVT. We know that u = f(a) and w = f(b) for some a and bin J, that f 
is continuous on [a, 6], and that v is “intermediate” between u and w, and so the 
IVT assures that f(c) = v for some c between a and b, as we needed to show. 


Odd-degree polynomials have real roots. A number r is a root (or a zero) of a 
function f if f(r) = 0. Thus, p(x) = 2? — 1 has roots +1, q(x) = 2? + 1 has no 
real roots, and 


s(x) = 2° — 15a* + 85a? — 22527 + 274a — 120 
= (a — 1)(a — 2)(x — 3)(a — 4) (a — 5) 


has roots 1, 2, 3, 4, and 5. Deciding whether an arbitrary function has any roots, let 
alone finding them, can be challenging, but the IVT offers some encouragement 
for a large class of polynomials. 


Proposition 3.19. Every polynomial function 
p(x) = Ox” + noes ++ +412 + ag 
of odd degree (a, 4 0 and n is odd) has at least one real root. 


About the proof. Every polynomial function is continuous everywhere. If we can 
find any a and b with p(a) < 0 and p(b) > 0, then the IVT will guarantee that 
p(r) = 0 for some root r between a and b. Finding such a and b for an odd-degree 
polynomial p() is indeed possible, because 


lim p(x) =-+oo, while lim p(x) = Foo, 


TOO 


and so p(a) must assume both positive and negative values. If, say, p(x) = 2° — 


7x? + 5a + 3, then p(0) = 3 while p(—1) = —13, so p must have a root in the 
interval (—1,0). We leave a more formal proof to the exercises. 


3.3. Why Continuity Matters: Value Theorems 


Fixed points. A number a is a fixed point of a function f if f(a) = a. (Equiva- 
lently, a is a root of the new function f(a) — x.) As with roots, the IVT can help 
assure that fixed points exist. 


EXAMPLE 1. Show that if f : [0,1] — [0, 1] is continuous on [0, 1], then f has 
a fixed point. 


SOLUTION. Note a crucial hypothesis about the codomain: 0 < f(x) < 1 for 
all x € [0,1]. This means, geometrically, that the graph of f stays inside the 
square window [0,1] x [0,1] in the xy-plane. Our problem amounts to showing 
that this graph touches or crosses the line y = x at least once. This may seem 
obvious, but we want proof, and the IVT will help. 

The trick is to look at the new function g(x) = f(x) — a; note that g is also 
continuous on [0, 1]. Note also that 


(@) g(0)=f(0)>0; Gi) gM) = f(1)-1 <0. 


If either g(0) = 0 or g(1) = 0, we’re done already, so we may as well assume 
that both (i) and (ii) are strict inequalities. In this case, the IVT guarantees that 
g(x) = 0 holds for some x € (0,1), as we aimed to show. © 


Maximum-minimum problems. Many hoary old problems of elementary calcu- 
lus—enclose as much area as possible in a rectangular pigpen, one side up against 
the barn, with 100 yards of fence—depend implicitly on the EVT. The functions 
to be maximized or minimized are typically continuous and defined on closed in- 
tervals, and so the EVT guarantees that the desired maximum or minimum exists. 
Sometimes the functions and intervals in question are lightly disguised. 


EXAMPLE 2. Suppose a and b are nonnegative numbers with a+ b = 3. Is there 
a maximum possible value of ab?? Of b/a? If so, what are these values? How is 
the EVT involved? 


SOLUTION. If we write b = 3 — a, then it is natural to consider the functions 


b 3-a 3 
= ab? = _ ie d Se =--l, 
p(a) = ab? = (3a)? and g(a) = 2 =3—8 = 3-4, 


our question is whether these functions achieve maximum values for a € [0, 3]. It 
is clear at a glance that g(a) — oo as a — 0°, so q is not continuous on [0,3], 
and all EVT bets are off. The polynomial function p, by contrast, is continuous 
on [0,3], and so the EVT guarantees that a maximum exists—but gives no clue 
how to find it. With calculus methods (taboo for now, as we haven’t gotten to 
derivatives) it is easy to show that the maximum value, four, is attained when 
o= 4. % 


See it in the notation? 


Sketch for yourself. 


At the formula or at a graph. 


See the exercises for a 
non-calculus strategy. 
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Using the IVT. 


If necessary, we can always 
take € smaller. 


3. Limits and Continuity 


Continuous inverses. Important calculus functions often come in “inverse pairs,” 
like 


f(z)=e” and g(x) =Inga; 
f(z) =a? and g(x) = Va; 


f(z) =tanz and g(x) = arctang. 


With due care taken for domains, the inverse relationship means that f(a) = 6 if 
and only if g(b) = a. For the last pair, for instance, we have f(1.2) = tan 1.2 = 
2.57215, while g(2.57215) = arctan(2.57215) = 1.2. 

It is natural to hope in this situation that if either of f and g is continuous, 
then so is its “partner”’ (Knowing this might be practically useful—sometimes 
it is easier to show directly that one rather than the other is continuous.) The 
following proposition addresses this situation. 


Proposition 3.20. Let f : I — Jandg: J — I be inverse functions, where I 
and J are open intervals. If f is continuous on I, then g is continuous on J. 


Proof: Since f has an inverse, it must be one-to-one on J, and this leads to a 
useful claim: 


If f is one-to-one and continuous on an interval I, then f is strictly 
monotone on I. 


The claim should seem plausible; we'll leave its proof as an exercise. Assuming 
the claim, we will handle the case where f is strictly increasing; the case for 
decreasing f is similar. 

Let b € J be given. Then g(b) = a for some a € J; note that also f(a) = b. 
To see that g is continuous at b, let € > 0 be given. Since J is an open interval 
and a € I, we may as well assume that the small closed interval [a — €, a + €] is 
contained in J. Because f is increasing, we know that f(a — «) < f(a) =b< 
f(a+). To complete the proof, we now choose any 6 > 0 so that 


f(a-6e) <b-6<b+6< f(ate). 
To see that this 6 works for the original €, consider any y in J with 
f(a-—e) <b-d<y<bt+6< flate). 
Applying the strictly increasing function g to all parts of this inequality, we get 


(a—e) <g(y) <g(b+45)< (ate), 


which is another way of saying that |g(y) — a] < e. 


3.3. Why Continuity Matters: Value Theorems 


Exercises 


1. Consider a function f : R — R anda domain point c € R. We say that f is 
locally bounded at x = cif there is some 6 > 0 and some M > 0 such that 
| f(x)| <M whenever x € (c—6,c+ 4). (In other words, f is bounded on 
some open interval centered at c.) 


(a) Show that f(x) = x? is locally bounded at x = 42. (Find specific 
values of 6 and M; there are many possibilities.) 


(b) Show that if f is continuous at c, then f is locally bounded at c. 


(c) Find a function f : R — R that is locally bounded but not continuous 
at x = 0. Explain why your example works. 


(d) Find a function f : R — R that is not locally bounded at + = 0. 
Explain why your example works. (Be sure your example is defined 
for all inputs x.) 


2. This problem revisits Proposition 3.19, page 158. Throughout, let p(x) = 
x” + anya" 1 +---+ a1," + ag, where n is odd. 


(a) Explain why lim, p(x) = oo. (Similarly, lim, p(#) = 
—oo.) 

(b) Use the definition of a limit at infinity to show that there exists b € R 
with p(b) > 0. (For similar reasons, there is a € R with p(a) < 0.) 


(c) In practice, it is often easiest to find a and b as above by (educated!) 
guessing. (Plotting may also work, but that’s too easy.) Use this ap- 
proach and the IVT to show that g(x) = 2° —x*—a? —x2?+a+4+1 has 
at least three roots. Can you also find the roots exactly, by algebra? 


3. Find examples of polynomials as prescribed. No formal proofs needed, but 
describe your answers clearly. 


(a) A polynomial of degree 5 with one root. 
(b) A polynomial of degree 4 with 3 roots. 
(c) A polynomial of degree 42 with no roots. 


(d) A polynomial of degree 42 with 41 roots. 


4. A function f : R — R achieves a maximum value at 2max if f (@max) < 
f(a) for all x € R, and similarly for a minimum value at &min. For instance, 
f(a) = cos x achieves its maximum value at x = 0 and its minimum value 


atv =T7. 
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(a) What, if anything, does the EVT guarantee about maximum and min- 
imum values of a continuous function f : R — R defined on all of R? 


(b) No odd-degree polynomial achieves a maximum or a minimum value 
on R. Explain why. (See an earlier problem.) 


(c) Every even-degree polynomial g(a) achieves either a maximum or a 
minimum value on R. Explain why. (Hint: This is just a little harder. 
Note that either (i) g(x) — oo as x — +00, or (ii) g(x) — —oo 
as x — -+oo. Show that q achieves a minimum in case (1) and a 
maximum in case (ii).) 


(d) Can an even-degree polynomial g(x) achieve both a maximum and a 
minimum value on R? Explain. 


5. This problem is about the functions p(a) = a(3 — a)? and g(a) = 2-1 
discussed in Example 2, page 159. 


(a) Does p assume a minimum on [0,3]? If so, what is it? What does the 
EVT say? 


(b) Does g assume a minimum on (0, 3]? If so, what is it? What does the 
EVT say? 


(c) We said, but didn’t prove, in Example 2 that the maximum value of p 
on [0, 3] is p(1) = 4. Prove it now, in two steps: 
(i) Factor p(a) — 4 (one factor is a — 1). 
(ii) Use your factorization to explain why p(a) — 4 < 0 whena € 
(0, 3]. 


6. Let P be an odd-degree polynomial. Show that P(x) = 7 for at least one 
real input x. (Hint: See Proposition 3.19, page 158.) 


7. Does f(x) = x? — 2x achieve maximum and/or minimum values some- 
where in (—1, 2)? What about a supremum or infimum? 


8. In each part following, either give an example or explain why none can 
exist. 


(a) A function f : (0,1) — R, continuous on (0,1), that achieves a 
maximum but not a minimum on (0, 1). 


(b) A function g : [0,1] — R, continuous on [0, 1], whose range contains 
m and e but not 3. 


(c) A polynomial function p : R — R, with range [—3, co). 


(d) A polynomial function q : R > R, with range (—3, 00). 


(e) A function r : R > R, continuous on R, with range (—3, 00). 


3.4. Uniform Continuity 


9. 


10. 


11. 


(a) Find a continuous function f : (0,1) — R whose range is [0, 1). 


(b) Find a continuous function f : (0,1) — R whose range is [0, 1]. 


(c) Is there a continuous function f : [0,1] — R whose range is (0, 1])? 
Why or why not? 


Let f : R — Rbe acontinuous function such that f(a) is a rational number 
for every real number input x. Show that f must be a constant function. 


The Principle of Persistent Inequalities (Proposition 3.14, page 154), de- 
scribes a nice property of continuous functions. Here we investigate two 
“one-sided” versions of the PoPI. 


A function f satisfies the PoPILTV (the “less-than” version of the PoPI) at 
«x = cif, for every number K with f(c) < K, there exists 6 > 0 such that 
f(a) < K whenever x € (c — 6,c + 6). The PoPIGTV (the “greater-than” 
version) is the same, except that the condition f(a) < K is replaced by 
f(a) >K. 


(a) Suppose f is continuous at x = c. The ordinary PoPI says that f 
satisfies the PoPILTV. Show that f also satisfies the PoPIGTV. 

(b) Suppose g(a) = 1 if x > 0 and g(x) = 0 otherwise. Show that g 
satisfies the PoPIGTV but not the PoPILTV at x = 0. 

(c) Suppose h(z) = lif « = 0 and h(x) = 0 otherwise. Which of the 
PoPIGTV and the PoPILTV does h satisfy at x = 0? 


(d) Show that a function f : R — R is continuous at x = c if and only if 
f satisfies both the PoPIGTV and the PoPILTV at x = c. 


. Let f : [a,b] — R be continuous on [a, b]. Show that if f attains either its 


maximum or its minimum value at an interior point c, then f is not one-to- 
one. 


. Let I be an interval (open or closed) and suppose that f : J — R is contin- 


uous and one-to-one. Show that f is strictly monotone. 


3.4 Uniform Continuity 


Here’s a definition with many familiar ingredients—but a new adjective: 


Definition 3.21 (Uniform continuity). Let J C R be an interval and f a function 
defined on S. We say f is uniformly continuous on I if for each € > 0 there exists 
5 > 0 such that 


\f(x) — f(y)| <¢ wheneverz,y¢Jand |r—y| <0. 
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We showed long ago that this is 
possible. 


See Figure 3.1, page 135. 


3. Limits and Continuity 


What’s new here? What’s wrong with ordinary continuity, over which we’ve 
worked hard? We’1l address these good questions right after some basic examples. 


EXAMPLE 1. Both f(x) = 32 +5 and g(x) = 2? are continuous on J = R. 
Are they also uniformly continuous on R? Why or why not? 


SOLUTION. The function f is uniformly continuous. For any inputs x and y, we 
have 


[f(2) — Fy) = [8a +5 — 38y — 5] = 3|x— yl. 
Thus, for any € > 0, the value 6 = €/3 does what Definition 3.21 asks: 
€ 


|f(x) — f(y)| <e wheneverz,y¢ Rand |x—y|<d6= 3 


The function g is not uniformly continuous. Showing this takes a little doing, 
but it is worthwhile to illustrate key ideas. We’ll do it by setting « = 1 and 
showing that no positive 6 works. Indeed, let any fixed 6 > 0 be given. First 
we choose an integer n with 1/n < 6, and set x = n+1/nand y = n. Then 
|a — y| =1/n < 6, but 


r\° 1 
j(e) a) =| = |= (m4 2) —nP=24 5 >2>e 
n n 
Thus, our attempted 6 failed, and so would any other. ©) 


Uniform vs. ordinary continuity. As Example 1 suggests, uniform continuity is 
a strictly stronger property than ordinary continuity. A function that is uniformly 
continuous on J is also continuous on J in the ordinary sense, but the converse 
is false: A function (like g(x) = x? on R) may be continuous but not uniformly 
continuous on J. Here are two ways to view the difference: 


© One 6-size fits all: Ordinary continuity of f on J means that, having first 
specified a € I and € > 0, we can then choose 6 > 0 so that | f(a) — f(a)| 
< e€ whenever x € I and |x — a| < 0. In particular, 6 may depend both on 
€ and on a. 


In uniform continuity, by contrast, 6 may depend on € but not on a. In this 
case, | f(x) — f(y)| < © holds for any pair of inputs x and y in I. For 
given ¢, in other words, a single choice of 6 works “uniformly” across the 
interval I. 


© Window dimensions: A function f is continuous at a if limy., f(x) = 
f(a). In Section 3.1 we described this condition graphically: For any given 
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(a) A given €, a good 6 (b) Same e, bad 6 


Figure 3.5. Why f(a) = 2? is not uniformly continuous on R. 


€ > 0 we can choose 6 > 0 so that the graph of f stays inside the rectangu- 
lar “window” 


[a —6,a+ 06] x [f(a) —«, f(a) + €] 


all the way from left to right, never escaping out the top or bottom. (This 
window has dimensions 26 x 2c.) A function f : I — R is continuous 
on I if, at every point of the graph, such a window can be found for every 
“half-height” €, no matter how small. 


To say that f : J — R is uniformly continuous means, in this language, that 
for given « > 0 we can choose some 6 > 0 so that a 26 x 2e€ window works 
at every point of the graph. This perspective suggests what went wrong 
with g(x) = «x? in Example 1: as g rises more and more steeply over its 
domain, its graph tends more and more to escape from the top or bottom of 
a given window. Figure 3.5 gives some idea of the problem. 


A function may be uniformly continuous on one set but not on another. 


EXAMPLE 2. The function f(x) = x? is not uniformly continuous on R. Is f 
uniformly continuous on J = [—3, 42]? Is f uniformly continuous on (—2, 17)? 


SOLUTION. Yes, and yes—and the second yes follows immediately from the 
first. If any f is uniformly continuous on a set S, then f is automatically uni- 
formly continuous on any smaller set S’ CS. 

Informally speaking, the first “yes” applies because f is “steeper” at x = 42 
than anywhere else on J = [—3, 42]. Therefore, any 6 that “works” for a given € 
at x = 42 will also work elsewhere in J, where the graph is less steep. 
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Before starting the formal proof, note that for any x and y we have 
f(z) — F@)| = |2* —y?| = |2- yl le +91. 
Now consider the last factor, |a + y|: For x and y in [—3, 42] I we have 
lz + y| < |x| + |y| < 42+ 42 = 84. 


Putting the pieces together produces a key inequality: | f(a) — f(y)| < 84 |x — y| 
for all x and y in I. This done, we’re ready for our (very short!) formal proof. 

Let « > 0 be given. Set 6 = €/84. This 6 works, since if x,y € J and 
|x — y| < 6, then 


lf(x) — fl =|x@—-ylle+y| < 84|z—y| < 846 =e, 
as desired. ©) 


This section’s main theorem covers several of the preceding examples. 


Theorem 3.22. If f is continuous on a closed and bounded interval |a, b], then f 
is uniformly continuous on [a, }). 


Proof: Let « > 0 be given, and assume, toward contradiction, that no 6 > 0 
works in the sense of Definition 3.21. Then 6 = 1/n must fail for every positive 
integer n. This means that, for each n, there must exist x, and y,, in [a, b] such 
that 


ln —anl<— but [f(en) — Fn)I 2 


The two sequences {x,,} and {y,,} so constructed are obviously bounded; apply- 
ing the Bolzano—Weierstrass theorem to {x,,} produces a subsequence {,,, } that 
converges to some limit a9 € [a, b]. Now we extract the same subsequence {¥n, } 
from {y,,}; because 


1 
lZn, — Yng| < Nk <r 


for all k, we must have both x, — x and Yn, — Xo. 
Now a contradiction looms. Since f is continuous at 7, we must have both 


f (@nx) > F(to) and f Yn.) > f(#o). 


This is incompatible with our original arrangement that 


If (@n.) — f Yn) > € 


for all k, and so the proof is done. 


3.4. Uniform Continuity 


Staying close. If uniform continuity seems forbiddingly technical or obscure, 
that’s because we worked our way up to it gradually, via ordinary continuity at 
points and on sets. In another sense, however, uniform continuity of a function f 
on a set S'is the cleanest and simplest version possible: 


f(a) and f(y) are within « whenever x and y are within 0. 
We will exploit this view later, when we study integrals. This basic idea—that 
f “preserves closeness”—translates naturally to Cauchy sequences, which are all 


about closeness. 


Proposition 3.23. If f is uniformly continuous on I and {xy} C I is a Cauchy 
sequence, then { f(x»,)} is a Cauchy sequence, too. 


Proof: Let €« > 0 be given. By hypothesis there is some 6 > 0 that works in the 
sense of uniform continuity of f on I. Because {x,,} is a Cauchy sequence, we 
can choose WN so that 

|in —Lm| <6 whenevern >m>N. 


Because %, 2m € J for all n, our choice of 6 implies that 


|f(an) — f(am)| <¢€ whenevern >m> JN, 


as required. 


EXAMPLE 3. Let f(x) = 1/2. Use Proposition 3.23 to show that f is not 
uniformly continuous on J = (0,00). Is f uniformly continuous on [1, 00)? 


SOLUTION. We could use the definition to show directly that f is not uniformly 
continuous (see the exercises), but using Proposition 3.23 is shorter. We can sim- 
ply observe that the sequence {1/n} is contained in I and Cauchy, while the 
output sequence { f(2,)} = {n} is surely not Cauchy. 

On the interval [1,0o), by contrast, f is uniformly continuous. To see this, 
note that for x, y € [1,00) we have 


This implies that, for any « > 0, the choice 6 = ¢ works in the definition of 
uniform continuity. ©) 


Yes, we mean 6, not e. 
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3. Limits and Continuity 


Staying bounded. Let f : J — R be continuous on a bounded interval J. If 
I = [a,b] is closed, then the EVT guarantees (among other things) that f is 
bounded on I. On an open interval I = (a,b), by contrast, a continuous function 
may well be unbounded—as is f(x) = 1/x on (0,1). But uniformly continuous 
functions behave better, as the following proposition shows. The proof involves 
ingredients that are familiar by now. 


Proposition 3.24. Let I be any bounded interval I, open or closed. If f is uni- 
formly continuous on I, then f is bounded on I. 


Proof: Suppose toward contradiction that f is unbounded above. (A similar ar- 
gument applies if f is unbounded below.) Then we can choose x1, 42, %3,..., all 
in J, such that 


f(x1) > 1, f (x2) > 2, f (x3) > 3, 


By the Bolzano—Weierstrass theorem, some subsequence {2 } is convergent— 
and therefore Cauchy. Now Proposition 3.23 says that the image subsequence 
{f (n,,)} is also Cauchy—and therefore bounded. This is impossible, because 
we chose the x,, such that f (an,) > nx > k for all k. 


Exercises 


1. Throughout this problem, let f(x) = x? and € = 1. Since f is continuous 
at c = 1, we can find some 6 > 0 such that | f(a) — f(1)| < 1 when 
ja-—1] <0. 


(a) Show that 6 = /2 — 1 = 0.4142 “works” in the situation described 
above. 

(b) Show that 6 = V101 — 10 = 0.0499 works for c = 10. (As before, 
f(a) =a? ande = 1.) 

(c) In the spirit of the preceding parts, find values of 6 that work for c = 1, 
c = 10, c = 100, and c = 1000. How do these values reflect the fact 
that f is not uniformly continuous on R? 

(d) Now let g(x) = 10x + 3 and (again) « = 1. As in the preceding part, 
find values of 6 that work for c = 1, c = 10, c = 100, and c = 1000. 
How do these values reflect the fact that f is uniformly continuous on 
R? 

(e) The function f is uniformly continuous on the interval J = [1, 1000]. 
Find a value of 6 that works for € = 1 in Definition 3.21. 


2. Let I and J be any intervals, with J C J, and let f be a function. Show that 
if f is uniformly continuous on J, then f is uniformly continuous on J, too. 
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3: 


hs 


8. 


(This is not a trick question—the claim is indeed simple or even obvious. 
The point is to give a proof based on the definition.) 


. Let f(x) = 2°. 


(a) Show directly from the definition that f is uniformly continuous on 
[—3, 42). (Hint: a? — b? = (a — b)(a? + ab + b?).) 


(b) Use Theorem 3.22 and the claim in the preceding problem to show 
(again) that f is uniformly continuous on [—3, 42). 


. Use Definition 3.21 (not theorems) in each part. 


(a) f(x) = 5 is uniformly continuous on R. 


(b) g(x) = 2% + 7 is uniformly continuous on R. 


(c) h(x) = x? + 7 is uniformly continuous on [—100, 42]. 


(d) l(a) = 1/2 is not uniformly continuous on (0, 1). 


Use either Definition 3.21 or appropriate theorems in each part. 


(a) f(x) = x° is not uniformly continuous on R. 

(b) g(x) = \/z is uniformly continuous on [1, 00). 

(c) h(x) = cos(1/2) is not uniformly continuous on (0, 1). 

(d) k(x) = ee 1/2) is uniformly continuous on (0, 1). (Hint: If we set 
)= 


k(0) = 0, then & is continuous on [0, 1].) 


. Show that if f is uniformly continuous on an interval J, then it is also 


continuous in the ordinary sense. 


Assume in all parts that both f and g are uniformly continuous on an inter- 
val I. 


(a) Show that f + g is uniformly continuous on J. 


(b) Give an example to show that fg need not be uniformly continuous 
on I (even though fg is continuous in the ordinary sense). 


(c) Show that if J = [a,b] is closed and bounded, then fg is uniformly 
continuous on I. 


(d) Show that if both f and g are bounded on J, then fg is uniformly 
continuous on J. (This is a little harder.) 


A function f : I — Ris called Lipschitz continuous (after a nineteenth- 
century German mathematician) on I if, for some constant K, the ine- 
quality 

f(z) -f)|<K |z—-y| 
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holds for all x and y in I. The sine function, for instance, is Lipschitz 
continuous (with K = 1) on R, because (as can be shown) 


|sinz — sin y| < |x — y| 


for all x, y € R. Lipschitz, like uniform, continuity is a “stronger” version 
of ordinary continuity. 


(a) Show that if f is Lipschitz continuous on J, then it is also uniformly 
continuous on J. 


(b) Show that f is Lipschitz continuous on I if and only if the ratio 
\(f(x) — f(y))/(a — y)| is bounded for x,y € I, x 4 y. 

(c) We know (how?) that g(x) = \/z is uniformly continuous on [0, 1]. 
Show that g is not Lipschitz continuous on [0, 1]. (Hint: One approach 
is to let x, = 1/n and y, = 2/n for positive integers n, and then 
study the ratio in the preceding part.) 


. Let f : R — R be a function. The condition 


Ve >0 3d>0 suchthat |x—y| <d => |f(x)—- fly)| <e 


defines uniform continuity of f on R. What does each of the following 
conditions say about f on R? Must such an f also be uniformly continuous 
on R? If not, give an example. 


(a) Ve >0 and Vd > 0, |x—y| <6 ==> |f(x)—-fly)|<e 
(b) de > 0 such that VO > 0, |Ja—y| <6 => |f(2)- f(y)| <e 


FQ) 6 
f(y)| <«. 


(c) Ve >0 3d > 1 suchthat |x —y| <6 => |f(x 
(d) Ve > 1 36 > 0 such that |x—y| <6 => |f( 


Sl SS: 


x 


. Claim: Suppose f is uniformly continuous on the half-open interval (0, 1]. 


Then we can “extend f” (i.e., define f(0)) to be uniformly continuous on 
the closed interval [0, 1]. 


Prove this in the following steps: 


(a) Set f(0) = limp. f (1/n). Explain why this makes sense—i.e., 
explain why the limit exists. (Hint: The sequence 1,1/2, 1/3, ... is 
Cauchy; apply Proposition 3.23, page 167.) 

(b) (This is a little harder.) To show that f is uniformly continuous on 
all of [0,1], let e > 0. Set e’ = €/2, and choose 6 > 0 that “works” 
for ¢’ in the definition of uniform continuity of f on (0, 1]. Show 
that this 6 works for € in the definition of uniform continuity of f 
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on [0,1]. (Hint: It is enough to show that | f(x) — f(0)| < ¢ when 
|x — O| < 6. To do this, choose any no such that 0 < 1/no < x 
and | f(1/no) — f(0)| < €/2. (Why is this possible?) Then use the 
triangle inequality.) 


11. Use Definition 3.21, not theorems, to show that f(x) = /a is uniformly 
continuous on [0, 1]. (Hint: Prove first that if x > y > 0, then /x — /y < 


f/@— y.) 


12. Use Definition 3.21, not theorems, to show that f(a) = 1/(x? + 1) is uni- 
formly continuous on R. 


Derivatives 


4.1 Defining the Derivative 


Calculus at last. We’ve worked hard to develop important calculus ingredients: 
functions, limits, continuity, etc. At last we are ready to put them together; we’ll 
treat derivatives in this chapter and integrals in the next. Many derivative and 
integral formulas and calculations are, of course, already familiar from elementary 
calculus. The new twist is to derive such “known” facts rigorously, and to put 
them in theoretical context. 

First comes the key definition: 


Definition 4.1 (Derivative at a point). Let f be a function defined on an open in- 
terval J, and let a be a point in J. We say f is differentiable at a, with derivative 
f'(a), if the limit 

ta 8 x£—-4 


exists. 


Let’s test it on some familiar well- and ill-behaved functions. 


EXAMPLE 1. Find, if possible, derivatives at a = 3 for the functions 
cx) =5, &2)=22+7, g(z)=27, and a(x) =|x—-3]. 


What general facts do the calculations suggest? 


SOLUTION. The limit calculation for c(x) is easy: 


Site EDT ep 


23 2-3 233 7-3 


0. 


Thus, c’(3) = 0. Since the value x = 3 played no important role in the calcula- 
tion, we’d guess (correctly) that c/(a) = 0 for all a. 
Finding @’(3) is easy, too: 
L(x) — 2 -1 22 — 
PG) tay OS) eg PE ae Og 


a3 2-3 xL3 z—3 “2-3 GF 
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See the exercises. 


Watch for a factoring trick. 


Left- and right-hand limits do 
exist at x = 3, but they are 
unequal. 


Sketch these functions to see 


the idea; no technology needed. 


Look what happens along a 
rational sequence tending to 
zero, and see the exercises. 
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Thus, ¢’(3) = 2, and (almost) the same calculation shows that ¢’(a) = 2 for all 
inputs a. 
For qg, we need another limit calculation: 
= 2 
van 12) 98) _ yp? 


9 F = 
233 x—-3 = 2-3 Soe 


q'(3) = 


Thus, q'(3) = 6. This time the calculation looks a bit different away from x = 3, 
so we defer (just briefly) finding q’ (a) for other inputs. 
The derivative a’ (3), if it exists, is the value of 


- = 1 if 
lim gays) = lim jes = lim ; ae 
t3 2-3 23 2 63 1 ifa <3. 
From the last form, we see that a’(3) does not exist. ©) 


The results of Example | won’t surprise any calculus veteran. Let’s see what 
happens with some stranger functions, one of them an old friend. 


EXAMPLE 2. Are the functions 


1 ifeeQ, x2 ifs>O, 
a d = 
F(2) f if2¢Q, “ g(x) {3 if2 <0 


differentiable at x = 0? 


SOLUTION. The short answers are no and yes. The limits in question are 


lim f() and lim g(a) 
z—-0 2 z-0 


The first limit clearly fails to exist, so f is not differentiable at x = 0. For the 
second limit, we just note that 


g(x) f ifx > 0, 


z 0 ifx<0, 
and so (x) 
! ge GAL) 
g(x) = lim ~~ =0. % 


Different Views of the Derivative 


The derivative is defined formally as a limit, but it has several familiar and useful 
interpretations. We review several briefly. 


4.1. Defining the Derivative 


Average and instantaneous rates. The expression 


f(x) = f(a) 


z-a ’ 


known as a difference quotient, can be thought of as the average rate of change of 
f(x) with respect to x over the input interval between a and x. The derivative is 
the limit of such rates as x approaches a, and so can be seen as the instantaneous 
rate of change of f when x = a. 

We won’t dwell now on the connection between derivatives and rates. But we 
should acknowledge that the idea is among the best and most useful ever had—by 
anybody. The power of calculus to describe and predict real-world change, from 
falling apples to celestial mechanics, is hard to overstate. As Isaac Newton and 
others discovered, calculus is the right “language” in which to express physical 
laws and to deduce their consequences. 


Slopes. In elementary calculus, f’(a) is sometimes described simply as the “slope 
of the graph” of f at x = a. To be a bit more precise, note that the difference 


quotient 
f(x) = f(@) 
xa 

is the slope of a certain secant line, which passes through the points (a, f(z)) 
and (a, f(a)) on the graph of f. If the derivative exists, then the limiting value 
f’(a) can also be thought of as a slope, this time of the tangent line to the graph at 
x = a. Observe, too, that slope is itself a rate (of ascent, in this case), and hence 
a special case of the rate-of-change interpretation discussed above. 


Linear approximation. Thinking of derivatives as slopes is simple and helpful, 
especially for well-behaved functions. But subtleties can arise. What is really 
meant, for example, by the slope of a curve at a point? What if the graph of f is 
not a smooth curve at all, but ragged or broken? To skirt such problems, we can 
consider /inear functions, for which no such troubles arise. 


Definition 4.2 (Linear approximation). Let f be a function and a a domain point 
for which f’(a) exists. The linear approximation to f at a (also known as the 
first-order approximation) is the linear function £, defined by 


la(x) = f(a) + f'(a)(@ — a). 
Observe some basic properties of the linear approximation: 
* Simplest examples: If f(x) = x? and a = 2, then 


lo(x) = f(2) + f'2)(a — 2) =4 4 4(e — 2) = 4 4 de. 
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And deservedly popular in 
elementary calculus courses. 
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0.010 


VA 0.005 F 


0.000 F 


0.10 0.05 0.00 0.05 0.10 


(a) Two ordinary tangent lines (b) A stranger tangent line 


Figure 4.1. Two views of linear approximation. 


If f(x) = 2? and a = 0, then 


lo(x) = f(0) + f’(0)(x — 0) =0+ O(@ —0) =0. 


Shared values—up to a point: The functions f and @, “agree to first order” 
atv =a: 


f(a) =fa(x) and f"(a) = (a). 


(The second fact holds because ¢,, being linear, has constant derivative.) 
Away from x = a, anything can happen. In the case of f(a) = «x? and 
I3(x) = 9+6(a#—3), for example, we have f(—42) = 1764 and f’(—42) = 
—84, while ¢3(—42) = —261 and ¢5(—42) = 6. 


Graphical views: The close connections between f and £, are reflected in 
their respective graphs. Indeed, the graph of @, is in a natural sense the 
line that “best fits” the f-graph at « = a. In elementary calculus parlance, 
this line is called the tangent line at x = a. The word “tangent” suggests 
“touching,” and in many cases this line does indeed “touch but not cross” 
the graph of f at the point (a, f(a)), as illustrated in Figure 4.1(a). But in 
other cases the graphs of f and @, are seriously intertwined, as Figure 4.1(b) 
and the following example illustrate. With such possibilities in mind we 
will usually refer to “linear approximation” rather than “tangent lines.” 


EXAMPLE 3. Show that the function f : R — R, shown in Figure 4.1(b) and 
defined by 


_ ja*sin(1/x) ifa £0, 
roy {' ife—G. 


is differentiable at a = 0. What is the linear approximation there? 


4.1. Defining the Derivative 


SOLUTION. The limit in question is 


= 0 2 1 
f’(0) = lim Acer) = lim eset) = lim zsin(1/z), 
xz—0 Lr 0 xz—0 4 od xz—0 
which we’ve already shown (by squeezing; see Example 6, page 139) to be zero. 
Thus f’(0) = 0, and so the corresponding linear approximation—also visible in 


Figure 4.1(b)—“is just 
lo(x) = f(0) + f"(0)(@ — 0) = 0. 


The calculation was easy, but the situation is still undeniably strange: The graph 
of f crosses its tangent line infinitely often in every open interval (—d, 6) around 
a = 0. In the interval (—.001, .001), for instance, we have f (1/(k7)) = 0 for all 
integers k with |ka| > 1000. ?) 


Magnification. A useful geometric alternative to the derivative-as-slope view in- 
volves magnification factors. If f’(a) = m (for magnification), then for x near a 


we have 
f(x) — f(a) 
r-a 


=m 
which implies, in turn, that 


|f(2) — f(a)| © |m| |x — al. 


Interpreting absolute values as distances, we see that the distance between outputs 
f(a) and f(a) is about |m| times the distance between inputs x and a. 

If f(z) = «?, for instance, we have f’(3) = 6, and f maps the small input 
interval (2.95, 3.05) to the output interval (.7025, 9.3025), which is six times as 
long. 


Derivatives as Functions 


For a given function f the derivative f’(a) depends on a. If, say, f(x) = x”, then, 
as we have seen, f’(a) = 2a holds for every input a, and we might simply write 
f'(x) = 2a. It is natural, in other words, to think of f’ as a function in its own 
right, derived in a special way from /f. 

To avoid “symbol-creep” it is convenient to use the same input symbol, often 
x, for both f and f’. Thus we might write something like 


t)-— 
toa t-2 
when we want to think of both f and f’ as functions of x. The difference with 


earlier versions of the derivative limit is entirely notational—no new mathematics 
is involved. 


Hence the term derivative 
function. 
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See the factoring trick? 
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EXAMPLE 4. If f(a) = \/a, what’s f’(x)? How are the domains of f and f’ 
related to each other? 


SOLUTION. Let’s calculate. For x > 0, we have 


1 1 1 


= jj SS ES eo SS 
mo Vit ja Verse 2a 


as we learned back in elementary calculus. 

No derivative f’(0) exists in this case. One problem is that f is not defined 
on any open interval containing 0, as the definition requires. Thus f’ has domain 
(0, 00)—a proper subset of [0, 00), the domain of f. © 


Increase, decrease, and derivatives. As elementary calculus experience sug- 
gests, properties of the derivative function f’ reflect corresponding properties of 
f. Where f’ is positive, for instance, we expect f to be increasing, and where f’ 
takes large positive values, we expect f to be rapidly increasing. These expecta- 
tions will turn out to be well-founded, but giving precise proofs is a little trickier 
than one might expect. We’ll return to such matters later in this chapter. 


Higher Derivatives 


The function f’ may be differentiable in its own right, to produce a new function 
f”, the second derivative of f. Repeating the process produces higher-order 
derivatives f'”, f®, f©, and so on. If f(a) = x?, for example, then we have 


fie) =2", fF") =2, fe) =0 = f(a) = f(a) =... 


Higher derivatives, better approximation. The linear (or first-order) approxima- 
tion €,(x) = f(a) + f’(a)(a — a) “matches” f at x = a in the sense that 


f(a) =fa(a) and f"(a) = £,(a). 


Using second derivatives we can go a step further. The quadratic (aka second- 
order) approximation to f at x = ais a quadratic function that fits f even more 
closely than does @,: 


f(a) =qa(a) and f"(a)=q(a) and f"(a) = gy (a). 
If, say, f(x) = cos x and a = 0, then we have (as you know from calculus) 


f(0)=cos0=1; f’(0)=—sin0=0; f”(0)=—cos0=-—1. 
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Figure 4.2. Linear and quadratic approximations to the cosine function. 


Thus, €9(x) = 1+ 0x = 1, and basic calculus shows that go(2) = 1 — x?/2 
“agrees” with f in the desired sense: 


q(0)=1; (0)=1; f"(0)=—-1. 


(We will give a general formula for qq in the next section.) Given the extra match- 
ing derivative, we expect go to approximate f even better near x = O than does 
fo. Figure 4.2 doesn’t disappoint. 


Properties of Differentiable Functions 


Any function f for which f’(a) exists must be in some sense “nice” at x = a. 
The next theorem describes the most basic such form of good behavior. 


Differentiability and continuity. A function continuous at a point need not be dif- 
ferentiable there. The function h(x) = ||, for instance, is continuous but not 
differentiable at a = 0. (At all other inputs h is both continuous and differen- 
tiable.) But the converse assertion is true: 


Theorem 4.3 (Differentiability implies continuity). If f is differentiable at a, then 
f is also continuous at a. 


About the proof. To illustrate the idea, suppose f(3) = 42 and f’(3) = 7. To 
show continuity of f at « = 3, we need to show that lim,.3 f(a) = 42, or, 
equivalently, that lim,_.3 (f(z) — 42) = 0. The calculation involves a little 
trick: 


42 _, f(z)-42 ,. 
tn (F)— 42) — i, PE (2 — 8) = ty SS iy — 9) 
=7-0=0, 


as desired. The general proof is similar, and left as an exercise. 


We showed something very 
similar in Example 1. 
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Digging deeper. The following lemma delves a little further into good-behavior 
implications of differentiability. 


Lemma 4.4. Let f be a function defined in an open interval containing x = a, 
and suppose f'(a) exists. 


(i) If f’(a) 40, then f(x) F f(a) forall x F ain some interval (a—6,a+ 0). 


(ii) If f’(a) = 0, then—for every € > 0, no matter how small—there exists 6 > 0 
such that 


|f(z) — f(a)| < ela — al 


whenever |x — a| < 0. 


(iii) If f has a local minimum or local maximum at x = a, then f'(a) = 0. 
Proof: All parts follow from closer looks at the defining limit: 


x2—a Lr-a 


To prove (i), suppose f’(a) # 0 and set « = | f’(a)| > 0. Because the preceding 
limit exists, we can choose 6 > 0 so that 


f(x) ~ f(a) 


w—-a 


- Ha] <e=U"o) 


whenever |x — a| < 6. In particular, we must have f(x) # f(a) for all such 2; 
See for yourself. otherwise the inequality above fails. 
Claim (ii) essentially just restates the existence of the key limit. Because 
f'(a) = 0, we have 


iti JO aIO ~ aie 


za erZ—-a x@Z—a 


fa) = fl@)| 


w—-a 


By definition of the limit, for given « > 0 there exists 6 > 0 such that, whenever 
|x — al < 6, we have 


f(z) = fla) <e, or,equivalently, |f(x) — f(a)| <¢|x—al, 
L-a 
as claimed. 
We’ ll sketch a proof of (iii) assuming that f has a local minimum at x = a; 
Polishing the proof is an the proof for a local maximum is similar. In this case there is an open interval 
exercise. I =(a—06,a+6) with f(a) < f(x) for x € I. In particular, 
DOVE 6 sige ope DEI oa Gee: 
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0.004 F 


0.002 F 


0.000 F 


—0.002 F 


—0.004 


-0.4 -0.2 0.0 0.2 0.4 


Figure 4.3. The function f (2) = x — sina: very flat near x = 0. 


These inequalities imply, respectively, that 


lim i) Ne) >0 and lim 


tat r—a @w—a7 r= 6 


fla)-f@) — , 


— ? 


from which the only possible conclusion is that f’(a) = 0. 


EXAMPLE 5. If f(x) = sina and g(x) = x — sing, then f’(0) = 1 and 

g'(0) = 0. What does Lemma 4.4 say about f and g at a = 0? We'll prove these things in the 
next section. 

SOLUTION. Claim (i), applied to f, says that sinz # sin0 = O for x in 

some interval (—d, 6). This is no surprise—even the big interval (—7, 7) works. 

Claim (ii), applied to g, says something more interesting. If, say, « = 0.01, then 

(ii) says that for x in some interval (—4, 45), 


|e —sinz| < 0.01 |z]. 


Figure 4.3 illustrates graphically what this means—especially about how closely 
x approximates sin x near x = 0. The function g shows, finally, that the converse 
to claim (c) is false: Even though g’(0) = 0, g attains neither a maximum nor a 
minimum at 0. © 


Exercises 


1. Suppose f’(x) exists for all x. Every calculus student knows that if g(a) = 
f(x) + 3, then g'(a) = f’(a) for all a. Show this using Definition 4.1, 
page 173. 


2. Use Definition 4.1, page 173, to find each of the following derivatives, or 
to show that it doesn’t exist. 
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(a) f’(1) if f(a) = 

(b) g'(1) if g(x) = 2°. 

(c) h’(1) if h(x) = x*?. (Hint: The difference quotient reduces to a nice 
form.) 


(d) k’(1) if k(x) = 1/(a +3). 
(e) m!(1) if m(xz) = 42+ 2% +2 4+1/(x 4+ 3). (Hint: Use a fact about 


limits of sums.) 


3. (a) Use Definition 4.1, page 173, to show that if f(x) = 1/x andc 4 0, 
then f’(c) = —1/c?. 


(b) Prove by induction: (1/x”)’ = —n/ax”*+! for all integers n > 1. 
(Note: It is OK to use the product rule; we’ll prove it in the next 
section.) 


4. Assume in this problem that f : R — R is differentiable at x = 0. 


(a) Show that if f is strictly increasing, then f’(0) > 0. (Hint: Look at 
the sign of (f(a) — f(0))/(« — 0).) 
(b) Suppose that | f(2)| < |x| for all x. What can be said about f’(0)? 


5. Assume in this problem that f : R — R is differentiable at x = 0. 


(a) Show that if f(a) > f(0) for « > 0, then f’(0) > 0. (Hint: Consider 
lim, 0+ (f(x) — f(0))/(# — 9).) 

(b) Show that if f(a) > f(0) for x < 0, then f’(0) < 0. (Hint: Consider 
lim, —o- (f(x) — f(0))/(a — 9).) 


(c) What do the preceding parts say about a derivative at a minimum 
point? (We will explore this famous connection more carefully later.) 


6. Consider continuous functions f : R — Rand g: R — R, with f(0) = 
0 = f’(0). Show that fg is differentiable at x = 0, and (fg)’(0) = 0. (Use 
Definition 4.1; the product rule doesn’t apply.) 


7. Consider continuous functions f : R — R and g : R — R such that 
(i) f(0) = 0; Gi) f’(0) = 2; and Gii) g(0) = 3. Show that fg is differen- 
tiable at x = 0; find (fg)’(0). 


8. Consider the function defined by 


4.2. Calculating Derivatives 


(a) Show that f is continuous at x = 0. (Hint: Use left- and right-hand 
limits.) 


(b) Is f differentiable at x = 0? Prove your answer. 


. Suppose f(x) is differentiable at x = 2, with f’(2) = 3. Let g(x) = 


4 f(x) +5. Show that g is also differentiable at « = 2, with g’(2) = 12. 


. Suppose f(x) is differentiable at x = 2, with f’(2) = 3. Let g(x) = 


f(a +4) +5. Show that g(x) is differentiable at ¢ = —2. What is g’(—2)? 
Interpret the result geometrically. 


. Consider the function defined by 


af nei, 
wo={e fost 


(a) Show that f is continuous but not differentiable at « = 1. 


(b) Modify the definition of f as little as possible to make f differentiable 
atx =1. 


. Let f : R > R be any function such that | f (x)| < x? for all x. Show that 


f'(0) =0. 


. Let f : (—1,1) — R be any bounded function. 


(a) Define a new function g : (—1,1) — R by g(x) = xf (a). Show that 
g is continuous at x = 0. 


(b) Define a new function h : (—1,1) > R by g(x) = x? f(x). Show 
that h is differentiable at x = 0. 


4.2 Calculating Derivatives 


Familiar Rules 


Calculus veterans know well how to apply derivative rules (product, quotient, 
chain, etc.) in calculations like this one: 


4 
(2 sin(3x) + 4ina)' = 32 cos (3x) + 2x sin (3x) + -. 
x 


Our interest here is less in applying the rules—you’ve suffered enough—than 
in stating them precisely and proving them rigorously. After all, derivatives are 
limits, so algebraic properties of limits—like those in Theorem 3.3—will be key. 


Or even worse. 
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See page 151. 


We do need to avoid division by 
zero in combinations like f/ g. 
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Algebraic combinations. We have seen several ways to combine “old” functions 
f and g to form new ones. The combinations f + g, f — g, f -g, and f/g are 
constructed algebraically, while fog, go f, and f ogo f og involve composition. 
Propositions 3.11 and 3.12 say that if f and g are continuous at the domain points 
in question, then so are all the combinations just mentioned. 

We want to show now that similar properties hold for derivatives: if f and 
g are differentiable, and due care is taken with domains, then combinations are 
differentiable, too. We also expect that derivatives of combinations will be built, 
somehow, from derivatives of f and g. 


Theorem 4.5 (Derivatives of algebraic combinations). Suppose that f and g are 
functions differentiable at a; let C and D be constants. Then the functions 


$e CPD Gs “oe: tel 


are all differentiable at a (for f /g we need g(a) 4 0), with derivatives as follows: 
* Linear combinations: (Cf + Dg)’ (a) = C'f’(a) + Dg'(a). 
* Products: (fg)’ (a) = f'(a)g(a) + f(a)g'(a). 


/ 
: ae asa — g(a) f'(a)—f(a)g'(a) 
* Quotients: (4) (a) = = gia 
Notes on the theorem. Proofs follow or are left to the exercises. First, some 
informal observations: 


° Useful formulas—and more: The theorem not only justifies some well- 
loved techniques from elementary calculus but also guarantees that, under 
the given hypotheses, all indicated derivatives exist. This existence follows, 
as we will see, from corresponding properties of limits. 


¢ Sums and constant multiples: Both the sum rule and the constant multiple 
rule for derivatives are (easy) special cases of the result for linear combina- 
tions. Setting C = D = 1, for instance, gives one of these old favorites. 


¢ Linear combinations and linear transformations: The rule for linear com- 
binations can be phrased succinctly in the language of linear algebra: Dif- 
ferentiation is a linear transformation from one vector space to another. 
(Vectors, in this case, are functions.) 


° Approximate thinking: One view of the theorem concerns linear approxi- 
mation. By hypothesis, both f and g have linear approximations ¢,, and 
lg,a atx =a: 
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The theorem tells, in effect, how to combine yf, and £4, to create new 
linear approximations to functions like f + g, fg, and f/g. For example, 
the linear combination rule, says that for x near a we have 


3f(x) + 2g(x) © 3p .0(x) + 2€9,a(x) 
= (3f'(a) + 2g'(a)) (2 — a) + 3f (a) + 29(a), 


which should seem reasonable. Still more succinctly, we have €3+429,a = 
3L5,q + 2€g,q. The situation for products and quotients is a little more com- 
plicated; see the exercises. 


Proof: All parts follow from manipulating the limits that define the derivatives in 
question. We treat one part in detail and leave the rest to the exercises. 
For products, the theorem claims that 


ham £OOg(H) = flagla) 


rI—a rZ—-a 


= fla)g'(a) + f'(a)g(a); 


implicit, of course, is the claim that the limit exists. To see why, we manipulate 
the difference quotient limit, starting with a clever trick: 


f(x)g(@) = Fla)g(@) 


lim = lim 
LO r—-a ©2—a r—a 
= tim g(2) LOMO 4 him pq =H) 
= lim g(x) lim f(x) — fla) + lim f(a) lim g(x) = g(a) 
2a @2a r—a =2—a ane t—a 
= g(a) f'(a) + f(a)g'(a). 


Note especially the last step, in which we evaluated four limits, of which only one 
(the third) is completely trivial. The second and fourth limits define f’(a) and 
g'(a), which we’ve assumed to exist, and the first, lim, g(z) = g(a), holds 
because g is continuous at x = a. 


EXAMPLE 1. Finding derivatives of polynomials and rational functions, like 


3a’ —5a+2 


p(x) = 32" —5a+2 and q(x) = 
ex 


o] 


is easy for calculus veterans. How is Theorem 4.5 involved? 


f(@)g(@) = f(@g(@) + F@g(@) = fa)g(@) 
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Following minor 
re-arrangement. 


add and subtract the same thing 


algebra with limits 


more limit algebra 


Why? See Theorem 4.3, 
page 179. 
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Or prove ...it’s easy. 
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SOLUTION. Theorem 4.5 guarantees, first, that the derivative functions p’(z) 
and q’(x) exist. To prove that p(2)—or any polynomial function—is differen- 
tiable for all ~, we only need to observe that f(x) = x is differentiable, with 
f'(x) = 1 for all xz. Now Theorem 4.5 implies that power functions like 


are all differentiable in their own right, and that the familiar derivative formulas 
hold for all positive integer powers: 


(a?) 08, (23)! = 327, ave Gey aoe 


(In the exercises, we will discuss two proofs, one by induction.) 

We notice, too, that all polynomials are linear combinations of power func- 
tions, and so the theorem guarantees that all polynomials have derivatives. The 
function q(2)—like every rational function—is a quotient of differentiable func- 
tions, and so Theorem 4.5 guarantees differentiability except at roots of the de- 
nominator. For q there are three such roots; we will explore this function in more 
detail in the exercises. ©) 


EXAMPLE 2. What does Theorem 4.5 say about derivatives of f(a) = (2?+1)!° 
and g(x) = sinx/e"? 


SOLUTION. The product rule in Theorem 4.5 can help with f(a) if—but only 
if—we’re willing to multiply out the 15th power. That is tedious for a human but 
no big deal for, say, Mathematica: 


(07-41)? = 994-1507 4 10527% 44550744 13650774300382704---+1527+1. 


The result is easy, if laborious, to differentiate term by term. A wiser plan involves 
the chain rule; see below. 

Differentiating g requires the quotient rule, of course, but first we need deriva- 
tives of the numerator and the denominator. Although familiar from elementary 
calculus, the formulas 


(sinz)’=cosx and (e”)’ =e” 


are far from obvious, and we’ll defend them below. Assuming them for the mo- 
ment, the rest is easy. Since numerator and denominator are differentiable, and the 
denominator never vanishes, the quotient function is differentiable, with deriva- 
tive 


meee ree Sa ee © 


et ‘ er ex 


7 / . . 
sin x e”* cosx — e” sin x cos x — sinx 
ez a ; 
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Composition and the chain rule. Theorem 4.5 says nothing about functions like 
h(x) = sin(a?) and k(a) = sin(sin(sin(e”))), built by composition of differen- 
tiable functions. Theorem 4.6 assures us that such composites are indeed differ- 
entiable, and describes how the respective derivatives are combined. 


Theorem 4.6 (The chain rule). Suppose that g is differentiable at a and f is dif- 
ferentiable at b = g(a). Then f © g is is differentiable at a, and 


(fo9) (a) = f'(g(a)) - 9'(a). 


EXAMPLE 3. The functions f(x) = sina and g(x) = x? are differentiable for 
all x. Use the chain rule to differentiate 


fog(x) =sin(z”) and fo fo g(x) =sin (sin (x7)). 
SOLUTION. The chain rule applies directly to f o g: 
(fog) (a) = f'(g(a)) - 9'(a) = cos(a”) - 2a. 
For f o f og we have to dig deeper: 
(fo fog) (a) =f (fe g(a) (F29)' (a) 
=f" (fegla)): f(g(@))- 9'(@); 


we used the earlier calculation in the last step. Substituting the formulas for f and 
g gives the final answer: 


(fofeo 9g) (a) = cos (sin a’) . cos(a”) - 2a; 
the three-fold product represents two uses of the chain rule. S 


Here, before the proof, are some comments on the chain rule and why it is 
plausible. 


° Why multiply? The chain rule says that the derivative of a composition is 
a certain product of derivatives. Thinking of derivatives as magnification 
factors suggests why multiplication is the right thing to do. Suppose, say, 
that g’(a) = 2and f’(b) = 3. Then “lens” g and “lens” f magnify distances 
by factors of 2 and 3, respectively, and so we expect six-fold magnification 
on inputs sent first through g and then through f. 


It works for linear functions: We can just calculate, without fancy proofs, 
that the chain rule holds for linear functions f and g. If f(z) = Ar+ B 
and g(a) = Cx + D, then f’(2) = Aand g/(x) = C for all x, and 


fog(x) = Ag(a) + B= A(Cut+ D) = ACx + (AD+ B), 
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As always, due care for 
domains is necessary. 


We could substitute x for a, but 
why bother? 


We discussed the magnification 
view in the preceding section. 


Microscopes use the same 
principle. 


188 


Look at the first denominator on 
the right. 
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and so 
(f 0g)’ (x) = AC, 


as expected. 


° It works for nonlinear functions, too: Differentiable functions are closely 
approximated by linear functions, and so it is reasonable to hope that dif- 
ferentiable functions might satisfy the same chain rule. This turns out to be 
true; the formal proof explains why. 


Notes on a proof. We’re given that both limits 


_ g(x)— g(a) __, Pe A ett Ce 
im ———— =g(a) and pean = 


exist, where b = g(a), and we need to prove the related claim 


im LOC) = Fale) _ 


xz—b &—-a 


First we apply some clever algebra: 


f(g(a)) = flg@) _ f(g) = fla(@) , g(a) = g(a) 
) 


ra g(x) — g(a ca 
_ fy=f(0) g(a) ~ 9(@) 
y—b r-a ’ 


where y = g(x) and b = g(a). This looks promising. If x — a, then y > b 
(because g is continuous at a), and so 


lim 
ta L—a yb y—b @—a r—a 


just as we were hoping to show. 
This argument works just fine—and shows that the chain rule holds—for most 
functions f and g. But one technical flaw needs attention: The key factorization 


fan £9) =F) _ ,,, fF) = fl) |, ga) = sl) 


La c~t—a ©2—a g(x) _— g(a) @a r~r—a 


makes good sense if, but only if, g(a) 4 g(a) for all x # a in some interval 
containing a. For most functions g, this is the case, but there are exceptions. If g 
is constant, for instance, then our factorization is nonsense for all inputs x. But 
all is not lost: if g is constant, then f 0 g is also constant, and so, clearly, both 
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g'(a)=0 and (fog)’ (a) =0, 


and the chain rule still holds! 
We’re ready for the formal proof. As often happens, a special case takes some 
extra effort. 


Proof (of the chain rule): We treat two cases separately. 


Case 1: g'(a) # 0. In this case, g(x) # g(a) for all x # a in some interval 
containing a. We can therefore argue as above to obtain the chain rule. 


Case 2: g/(a) = 0. In this special case, we’ll show that (f 0 g)’ (a) = 0—again, 
as the chain rule claims. For notational simplicity, let’s assume that a = f(a) = 0. 
Our claim then is that, for any « > 0, we can find 6 > 0 with 


| ACEI) 


| <e or,equivalently, |f(g(x))| < €|a| 


whenever x € (—d, 6). 
To see this, let € > 0 be given; we can assume WLOG that € < 1. Note first 
that, because the limit 
f'(0) = lim £@) 


xz—-O0 
exists, the function f@) is bounded for x in some open interval (—0), 61). In 
other words, for some M > 0 we have 


a <M, or, equivalently, |f(a)| < M |a| 
x 


whenever x € (—d1, 61). We’ll assume, again WLOG, that M > 1. 
Next, because g’(0) = 0, we can choose 62 > 0 so that 


ae 


x 


<—_ or, equivalentl \g(x)| < : || 
=i > &q y> ONS FG I* 


whenever x € (02,62). Finally, let 6 be the smallest of 6), 62, and 1. This 6 
“works,” because if |x| < 6, then 


Ig(x)| < a Iz] <€ |e] <5 <5 <b, 


and so : 
If (9(2))| < M|9(2)| < MF; lel = ele]. 


This proves the chain rule. 
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See Lemma 4.4, page 180. 


It makes no difference 
mathematically. 


Challenging values of « are near 
zero. 
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We didn’t prove every detail 
rigorously. 


More details are in the 
exercises. 


We proved the first in 
Example 6, page 139. The 


second follows from the first. 
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Derivatives of Elementary Functions 


In Section 3.2, we discussed the “elementary functions” of calculus, built by com- 
bining basic function “elements”: power functions, trigonometric functions, ex- 
ponential functions, and their inverses. Proposition 3.10, page 150, says that every 
such elementary function is continuous throughout its natural domain. A similar 
claim holds for differentiability: 


Proposition 4.7 (Elementary functions are differentiable). Let f be an elemen- 
tary function defined on an open interval I containing a. Then f is differentiable 
at a. 


We’ve already done most of the work. Theorems 4.5 and 4.6 guarantee that a 


built-up function like 
Han a ) 


is indeed differentiable (except perhaps at endpoints of its domain), provided 
that its basic building blocks—including the sine, cosine, exponential, and log 
functions—are all themselves differentiable functions. Proving this rigorously 
for every basic elementary function takes some work. That isn’t our top priority, 
So we give only some brief samples here. 


EXAMPLE 4 (Trigonometric derivatives). All six standard trigonometric deriva- 
tive formulas follow from two basic limits: 
sinh cosh—-1 | 


lim =1 and lim =0. 
h—0 h h—0O 


How? Why? 


SOLUTION. All the remaining derivative formulas follow from the one formula 
(sin x)’ = cos x and from algebraic properties of sines and cosines. For example, 
we have 
' : T\\! T ‘ 
(cosa) = (sin («+ =) ) = cos (+ =) =-— sing; 


notice the chain rule implicit in the second equality, and two formulas relating 
sines and cosines. Here is another, this time thanks to the quotient rule: 


(tanz) = = —_.—_ = see" 2. 


cos? x 


: / : 
; sin x cos? x + sin? x i 
cos x 


To prove that (sina )’ = cos, we’ll use the well-known addition formula 
for sines: 
sin(a + h) = sin(x) cos(h) + cos(x) sin(h). 
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This will come in handy as we wrestle with the difference quotient: 


GHtay Si Se 


h-0 
_ jj sin(x) cos(h) + cos() sin(h) — sin(x) 
0 h 
=i . cos(h) — 1 . sin(h) 
= sin(z) lim A + cos(x) lim a 
= sin(x) -0 + cos(a) - 1 = cos(x), 
as desired. ©) 


EXAMPLE 5 (Exponential derivatives). The exponential derivative (e")’ = e” 
follows from the single limit 


How? Why? 
SOLUTION. Assuming the limit above, we have 


(e”)’ = lim ———— = lim ———— =e lim © 


as expected. ©) 


Derivatives of inverse functions. One loose end remains untied concerning dif- 
ferentiability of elementary functions: handling inverses of differentiable func- 
tions. (Logarithmic and exponential functions are inverses, for instance, as are 
f(x) = 2° and g(x) = ¥/z.) It turns out that inverses of differentiable functions 
are indeed differentiable—if the usual care is taken with domains of definition 
and to avoid division by zero. Here is the key result: 


Proposition 4.8 (Derivatives of inverse functions). Let f: I — Jandg: J — 


I be inverse functions, where I and J are open intervals. Suppose f is differen- 
tiable ata € I and f'(a) # 0. Then g is differentiable at f(a), and 


Figure 4.4 illustrates the idea. 
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f@r 


it 
f(a) a 


Figure 4.4. Derivatives of inverse functions f and g. 


Notes on a proof. If we assume for the moment that g is differentiable at f(a), 
the rest is easy, thanks to the chain rule. By definition of inverse functions, we 
have g(f(x)) = x forall x € I. Differentiating both sides gives 


W(f(x)) =x => g (F(a): f(a) =1 = g(f(@) = (a) 


whenever f’(a) 4 0. 

The harder part turns out to be showing what we assumed above: that g’(f(a)) 
exists. Figure 4.4 makes that assumption look reasonable, but proving it rigor- 
ously takes us a bit off our main course. We omit the detour. 


EXAMPLE 6 (Differentiating the natural log). Use Proposition 4.8 to differentiate 
Ing. 


SOLUTION. If f(a) = e® and g(x) = Ina, then we know f and g are inverses, 
and f’(a”) = e”. From Proposition 4.8, we have 


Writing b = e® gives the familiar formula g'(b) = (Inb)’ = ¢. ?) 
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Exercises 


1. We said in this section that f(z) = a” for any positive integer n, then 
f'(x) = na”—!. For one inductive proof (using the product rule) see Prob- 
lem 11, page 46. 


Another approach is to work directly with the definition: 


Fe) = jin A be 


(a) Note that x?—a? = (a—a)(x+a) and x3—a3 = (x—a)(x?+ar+<a”). 


Find (proof optional) an analogous formula for x” — a”. 


(b) Use your formula from (a) to prove the derivative rule. 
2. Let f, g, and h be everywhere differentiable functions. 


(a) Use the product rule to explain why (fgh)’ = f’gh + fg'h+ fgh’. 
(b) State a “three-deep chain rule” for the composition f og oh. 

(c) State a formula for the second derivative (f 0 g)” (a). 

(d) State a “product-chain hybrid” rule for (f 0 (gh))’ (a). 


(e) Consider the special case where f(x) = Ax + B for some constants 
A and B. What is (f © g)'(a)? What does this have to do with the 
linear combination rule (Theorem 4.5, page 184)? 


3. Suppose f and g are both differentiable at x = a, with derivatives f’(a) 
and g'(a). Use properties of limits to show that the linear combination 
h(a) = 3f(x) — 42g(z) is also differentiable at a. What is h’(a)? 


4. Prove the claim about linear combinations in Theorem 4.5, page 184. 


5. This problem outlines a proof—assuming both the product rule and the 
chain rule, which we proved separately—of the quotient rule part of The- 
orem 4.5, page 184. Assume throughout that f and g are differentiable at 
x = aand that g(a) 4 0. 


(a) Assume (it is easy to show) that the function k(a) = 1/2 is differen- 
tiable for all « A 0, with k’(a) = —1/zx?. Use this and the chain rule 
to find h’(a), where h(x) = 1/g(z). 

(b) Apply the product rule to f(x)/g(a) = f(a) - 1/g(a) to deduce the 
quotient rule as stated in Theorem 4.5. 


6. Use Theorem 4.5 and the fact that (sin x)’ = cos x to find derivatives of the 
other five standard trigonometric functions. 


193 


194 4. Derivatives 


7. Find derivatives of the following functions (assuming that derivatives of 
sin x, cos x, and e” are known). 


(a) sin (cos (e”)) 


(b) sinz - cosx - e” 
sin x 
Ce 
cosx: € 
sin x 


(d) 


cos (e”) 
8. Consider the function 


3a’ —5a+2 
LE Dh: 


The quotient rule guarantees that q is differentiable except at zeros of the 
denominator. 


(a) Use technology to plot g; notice the two vertical asymptotes. How are 
they related to derivatives? 


(b) The number one is a root of the denominator, but the graph of q has 
no vertical asymptote there. Why not? 


(c) The value q(1) is undefined in the formula above. What value of g(1) 
makes q differentiable at x = 1? Why? 


9. Let f : R — R be everywhere differentiable, with f(a) = 0, and let 
g = f?. Show that g(a) = g’(a) = 0. 


10. A function f is said to vanish to order k at x = aif f(a) = 0 = f’(a) = 
f(a) =--- = f-). Suppose that f vanishes to order three at 2 = a and 
g vanishes to order four. What can be said about the product function fg? 
Can you prove your answer? (Hint: What does the product rule say about 
higher derivatives, like (fg)”, (fg)’”, etc.?) 


11. Let g : R — R be everywhere differentiable, with g’(n) = 0 for all inte- 
gers n. 


(a) Let f : R — R be another everywhere differentiable function. Show 
that (f o g)’(n) = 0 for all integers n. 


(b) Let h(x) = cos(ma) - g(x). Show that h’(n) = 0 for all integers n. 
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4.3. The Mean Value Theorem 


The mean value theorem (MVT) is a marquee result of real analysis. As the inter- 
mediate value theorem (IVT) and the extreme value theorem (EVT) do for con- 
tinuous functions, the mean value theorem addresses a natural and fundamental 
property of differentiable functions. 


Theorem 4.9 (The mean value theorem). Suppose f : [a,b] — R is continuous 
on |a, b] and differentiable on (a,b). Then there exists c € (a,b) such that 


F(0) = Fa). 


fo- = 


Like the IVT and the EVT, the MVT says that a certain function—/’, this 
time—assumes a certain “mean” (or “average”) value. Unexciting as this might 
seem, it turns out to be a big deal, with surprisingly broad and deep implications. 
We explore several before proving the theorem. 


The graphical view. The MVT equation says something about s/opes on the 
graph of f: At some point c between a and b, the tangent line is parallel to the 
secant line joining (a, f(a)) and (0, f(b)). Figure 4.5 illustrates this. The (linear) 
secant line function, labeled L(a) in the picture, will play a role in the proof. 


The horizontal case: Rolle’s theorem. If we add to the other hypotheses the 
requirement that f(a) = f(b), then the conclusion takes a slightly simpler form: 


Theorem 4.10 (Rolle’s theorem). Suppose f : [a,b] > R is continuous on |a, b] 
and differentiable on (a,b), and that f(a) = f(b). Then f'(c) = 0 for some 
c € (a,b). 


ff) 


L(x) 
S(b)r 


S@>P 


Figure 4.5. The mean value theorem illustrated. 
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More than one possible c is fine 
with Rolle. 


The units might be miles and 
hours. 
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fO) 


fOr 


Figure 4.6. Rolle’s theorem: two possible c-values. 


In graphical terms, the new hypothesis says that the graph has the same height 
at x = a and at x = 5; the conclusion says that the graph has a horizontal tangent 
somewhere in between. Figure 4.6 shows two such horizontal tangents. 


Car talk. If f(t) is the position of a car at time t, then (f(b) — f(a))/(b — a) 
is the car’s average velocity over the time interval [a,b], and f’(t) is the car’s 
instantaneous velocity at time t. At some intermediate time c, says the MVT (and 
common sense), these two velocities must be equal. Rolle’s version sounds even 
more intuitive: if a car starts and ends at the same position, then it must be stopped 
at some instant in between. 


When is a function constant? It is obvious, and easy to prove, that a constant 
function has zero derivative. The converse—a function with constant zero deriva- 
tive is constant—is less obvious than it might seem. For example, the function 


1 ifz >0 


ae ee 


is nonconstant, but s’(2) = 0 for all « 4 0. If we rule out such “disconnected” 
domains, the converse is true—but the proof requires the MVT. Here are the de- 
tails. 


Proposition 4.11. Let I be an open interval and f : I — Ra function. If 
f(a) = O for all x € I, then f is constant on I. 


Proof: Let s and t be any two points in J. By the MVT, there is a point c between 
s and t for which 
f(s) = fF 


f(g- 


Since the left side is zero, so is the right, which means f(s) = f(t), as desired. 


4.3. The Mean Value Theorem 


Proving the Mean Value Theorem 


Proving the MVT takes some effort—not unexpectedly, given the theorem’s im- 
portance and usefulness. Fortunately, we’ve already done most of the hard work. 
We’ll argue in three short steps: 


* Step 1: Show that if f : [a,b] — R is as hypothesized, and f has a local 
maximum or local minimum at c € (a,b), then f’(c) = 0. 


© Step 2: Use Step 1—and another famous “value theorem”—to prove Rolle’s 
theorem. 


° Step 3: Apply Rolle’s theorem in a clever way to deduce the MVT. 


About Step 1. Our Step 1 claim should be familiar, for at least two reasons. 
First, it lurks behind most of those classic maximum—minimum problems of ele- 
mentary calculus. Second, we’ve already proved it (and a bit more), as part (c) of 
Lemma 4.4, page 180. Let’s move on. 


About Step 2. We assume that g : [a,b] — R is continuous on [a, b] and differ- 
entiable on (a, b), and that g(a) = g(b); we need to show that g’(c) = 0 for some 
c € (a,b). 

To apply Step 1, we invoke the EVT: Because g is continuous on [a, }}, it as- 
sumes both maximum and minimum values somewhere on that closed interval. If 
both the maximum and the minimum are assumed at endpoints, then the maxi- 
mum and minimum values are equal, and so g is constant. In this case, g’(c) = 0 
for all c € (a,b), and we’re done already. The alternative is that at least one of the 
maximum and the minimum is attained somewhere in (a,b). In this case Step 1 
applies, and again we’re done. 


About Step 3. Rolle’s theorem is more than a junior-grade version of the MVT. 
With a little ingenuity, the latter can be deduced from the former. The trick is 
to apply Rolle’s theorem not to the function f given in the MVT, but to another 
function g, cleverly contrived from f. We sketch the argument, leaving details to 
the exercises. 


1. Let L(x) be the linear function whose graph joins the points (a, f(a)) and 
(b, f(b)); see Figure 4.5. Notice some properties of L: 
b) — 
(i) L(a) = f(a); Gi) L(b) = f(b); Gii) L'(x) = Oy io) for all x. 


2. If we set g(a) = f(x) — L(x), then g(a) = 0 = g(b). By Rolle’s theorem, 
there exists c € (a,b) with g’(c) = 0. This c satisfies the conclusion of the 
MVT, and the proof is complete. 
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Find the largest rectangular 
pigpen ...one side is a river .... 


We use g, not f, to avoid 
confusion in Step 3. 


The extreme value theorem; 
see page 157 


A common one in analysis. 
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fneed not be strictly increasing. 


See Proposition 3.6, page 141. 
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Calculus Friends Revisited: Using the MVT 


Standard ideas of elementary calculus link properties of a function f to those of its 
derivatives f’ and f”. Using the MVT, we can prove many such facts—and some 
but not all of their converses. The following proposition offers some samples. 


Proposition 4.12. Suppose that f : I — R is differentiable at all points of an 
open interval I. 


(i) f’(a) > O forall x € I if and only if f is increasing on I. 
(ii) If f'(x) > O for all x € I, then f is strictly increasing on I. 
(iii) If f'(x) 4 O for all x € I, then f is monotone on I. 


Proof (s): For the “only if” part of (i), suppose that f’(x) > 0 for all x € I. To 
show f is increasing, let s and t be in J, with s < t. By the MVT, we have, for 
some c between s and t, 


f(t) = f(s) 


ar ei f'(c)_ or, equivalently, f(t) — f(s) = f’(e)(t—s). 


By hypothesis, the rightmost quantity is nonnegative. This implies that f(t) > 
f(s), which means that f is increasing, as claimed. 
For the “if” part of (i), suppose that f is increasing on J. For any x € I, we 


have 
im LO= Ff) 


’ 
tax t-—<x toat t-—2x 


where the first equality holds by definition of f’(a) and the second by a familiar 
property of limits. Because f is increasing, we have 


f= f(a) 


t—2x 


>0 
for all t € I with t > x. Thus f’(z) is the limit of a nonnegative expression, and 
so must itself be nonnegative. 

This completes the proof of (i); we leave remaining parts to the exercises. 


Onward and Upward with the MVT 


The mean value theorem has many and varied uses. Here we’ll briefly glimpse 
two of them. 


4.3. The Mean Value Theorem 


Darboux’s theorem: derivatives and intermediate values. The intermediate value 
says that a continuous function f : J — R defined on an interval J has the “inter- 
mediate value property”: 


If s and t are in I and f(s) < v < f(t), then f(c) = v for some v 
between s and t. 


It was widely believed in the nineteenth century that functions with the interme- 
diate value property must be continuous. 

This is false. In the late 1800s, Jean Gaston Darboux proved (essentially) the 
following theorem (a proof, based on the mean value theorem, is outlined in the 
exercises): 


Theorem 4.13 (Darboux’s theorem). Suppose f : I — R is continuous on |a, b] 
and differentiable on (a,b). Then f' has the intermediate value property. 


Another well-known fact—also proved in exercises—is that a derivative func- 
tion (f’ in the notation above) need not be continuous. 


EXAMPLE 1. Consider the function f(x) = x? sin(1/z); set f(0) = 0. Discuss 
the derivative f’. 


SOLUTION. For x 4 0 the function is clearly differentiable; standard derivative 
rules apply. At x = 0, the derivative is also differentiable, but discontinuous. 
Details will be explored in the exercises. ©) 


Taylor’s theorem: the MVT generalized. The mean value theorem equation can 
be rewritten to say that, under the appropriate hypotheses, we have 


fe) = f(a) + f(OO- a) 


for some number c between a and b. This equation can be thought of as saying that 
f(b) approximates f(a) with error no larger than the last term. Taylor’s theorem, 
dating from around 1700 and named for the Scottish mathematician Brook Taylor, 
offers a more powerful version of such an approximation. Here is a special case: 


Theorem 4.14 (Taylor’s theorem, n= 3). Suppose f, f', f”, and f’ all exist 
and are continuous on |a, b]. Then, for some input c in [a, b], we have 


£00) = Fa) + F(ay(o~ a) + LM ~ ay + ay 


A proof is outlined in the exercises. Here is an application. 


EXAMPLE 2. What does Taylor’s theorem say about f(2) = cosx fora = 0 
and b = 1? 
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SOLUTION. Applying Taylor’s formula gives 


cos(0) ae 


cos 1 = cos(0) — sin(0) - 1 — 5 rl 5 5 


Now, for c € [0,1] we have sinc < sinl & 0.84, so the last summand above 
can’t exceed 0.14. This means that cos 1 differs from 0.5 by less than 0.14, which 
is easily checked to be true. (In fact, cos1 * 0.54.) So Taylor’s theorem might 
help us calculate cosines in an emergency. © 


Exercises 


1. Let f : R — R bea differentiable function, and suppose that f has n real 
roots. Use Rolle’s theorem to show that f’ has at least n — 1 real roots. 
Give an example to show that f’ may have more than n — 1 real roots. 


2. (a) Use algebra to show that p(a) = x? + x has exactly one (real) root. 


(b) Use Rolle’s theorem to show (again) that p(a) = x + x has exactly 
one root. (Hint: If there are two roots, Rolle’s theorem leads to a 
contradiction. ) 


(c) Show that q(7) = a8 +2" +4a9+274+2°+2° +2 has exactly one 
root. 

(d) Show that r(x) = 248 +a4%+a9+ a7 +2°4 23 +24 987, 654, 321 
has exactly one root. 


3. Let f(x) = x? + ax + b, where a and b are fixed real numbers. 


(a) Explain why f(a) must have at least one root, regardless of the values 
of a and b. 


(b) Give examples (i.e., specific values of a and b) to show that f(x) can 
have one, two, or three roots. No formal proofs needed, but say briefly 
why your examples work. 


(c) Prove that f can have no more than three roots. (Hints: (i) Take it as 


given that a quadratic polynomial can have at most two roots; (ii) use 
Rolle’s theorem.) 


(d) Show that if a > 0, then f has exactly one root. 


4. Use Rolle’s theorem to show by induction that a nonconstant polynomial p 
of degree n has at most n roots. (Hint: If p has degree n, then p’ has degree 
n—1.) 


5. Suppose f : [a,c] > R has derivatives f’ and f”, both continuous on [a, db}, 
and assume that there exists b in (a,c) with f(a) = f(b) = f(c). 
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(a) Show that there exists x; and x witha < 21 < xq < cand f’(21) = 
f'(x2) =0. 

(b) Show that there exists xq with a < x9 < cand f” (ao) = 0. 

(c) State a generalization of these hypotheses that guarantees there exists 
ao with f(4?) (a9) = 0. 


6. Let f : R — R have continuous derivatives f’ and f”, and suppose that 
f(0) =0, f(1) = 1, and f(2) = 2. Show that f” (ao) = 0 for some xo in 
(0, 2). 


7. Suppose that f : R — R is periodic with positive period a (i.e., f(a+a) = 
f(a) for all x), and that f has continuous derivatives of all orders. Show 
that f’, f”, f’”, ...all have infinitely many roots. 


8. Suppose f : R — R has first and second derivatives f’ : R — R and 
f": ROR. 


(a) Show that if f’(a) > M for all x in an interval J, then f(b) — f(a) > 
M(b— a) for all inputs a and b in I with b > a. 

(b) Suppose f(0) = 0, f’(0) = 0 and f” (x) > 2 for all z > 0. Show 
that f(x) > x? forall xz > 0. 


9. In each part, find all possible values of c as described in the MVT. 


x +5; [a,b] = [1,7]. 
x? + 5a +7; [a,b] = (1, 7]. 
a lao) =" [0sl. 


(d) f(x) = 3x? + 5a + 7; [a, b] = [a, b] (the answer involves a and b). 


(e) If f is any quadratic function and [a, b] any interval, then the MVT 
equation holds with c taken to be the midpoint of [a,b]. Make this 
precise and prove it. 


10. In each part following, decide whether a value of c can be found as de- 
scribed in the MVT. If so, find one. If not, say which of the MVT hypothe- 
ses is not satisfied. 


(a) f(z) =|a]; [a,b] = [-1,]]. 
(b) f(x) = |x? -—10x+21|; [a,b] = [4,6]. 
(c) f(x) =|? - 102421]; [a,0] = [2, 4]. 
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Forget the C on a test and lose 
a point. 


11. 


15. 
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Suppose f(t) is the eastward velocity of a car at time ¢, with all quanti- 
ties measured in appropriate units. What does Rolle’s theorem say in this 
setting? (Hint: f’(¢) measures eastward acceleration.) 


. One hypothesis of the MVT is that f : [a,b] — R be continuous on all of 


(a, b]. Give an example (as simple as possible) to show that this hypothesis 
is necessary—i.e., find a function f : [a,b] — R, differentiable throughout 
(a, b), for which the MVT fails. 


. Suppose f and g are functions continuous on [a,b] and differentiable on 


(a, b). 


(a) Use results in this section to show that if f’(a) = g’(x) for all a € 
(a, b), then f(a) = g(a) -+C for some constant C. (This is sometimes 
called an identity theorem for differentiable functions.) 


(b) Suppose that f’(2) = g’(a) +5 for all x € (a,b). What can be said 
about f(a) and g(x)? Why? 


. Antiderivative tables in elementary calculus books are full of formulas like 


fcoscdz =sina+C. 


(a) Why exactly is the C there? What does this have to do with the MVT? 
(b) A very fussy professor might find fault with the formula [ de = 
In |a| + C. Explain. (Hint: Consider domains.) 


Give a detailed proof of Step 3 in the proof of the MVT. In particular, give 
an explicit formula for L(x) and use it to verify the other claims. 


. This problem is about various parts of Proposition 4.12, page 198. 


(a) State (don’t prove) versions of (i) and (ii) that involve the word “de- 
creasing.” 


(b) Prove (ii) carefully; mimic the proof of (i). 


(c) Prove (iil). 


. Let f : R — R be differentiable for all x, with f’(x) < 1 for all x and 


f(0) =0. 


(a) Show that f(x) < x forall x > 0. 
(b) Does a similar inequality hold for x < 0? Explain. 


. Let f : R — R be differentiable for all x. Suppose that f’(x) > 1 for all z. 


Show that the graph of y = f(a) can intersect the graph of y = x no more 
than once. 
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20. 


21. 


22. 


23. 


24. 


. Let f : R > R be differentiable for all x. Suppose that f’(x) < 1 for all x 


and that f(0) = 0. Show that f(a) < x for all x > 0. Must f(x) < a hold 
also for x < 0? Why or why not? 


Let f : R — R be differentiable for all x. Suppose that f’(x) > 1 for all zx. 
Show that the graph of y = f(a) can intersect the graph of y = x no more 
than once. 


A function f : R — R is called a contraction if the inequality 


If(z) — f(y) < |e — yl 
holds for all real numbers x and y. (As the name suggests, a contraction 


“shrinks distances.’’) 


(a) Show that if f is a contraction, then f is uniformly continuous on D. 


(b) Suppose that f : R — R is differentiable, and that | f’(x)| < 1 for all 
x. Show that f is a contraction. (Hint: MVT.) 


(c) Is f(a) = sin x a contraction? Is g(x) = e® a contraction? Why? 
Prove Darboux’s theorem (Theorem 4.13) as follows: 


(a) Suppose first thata < s < t < b, and f(s) <0 < f’(t). Show that 
f must attain a minimum at some point c strictly between s and t. At 
this c, f’(c) = 0. 

(b) Now suppose that a < s < t < b, and v is any number such that 
f(s) < v < f’(t). Consider the new function g(x) = f(a) — va. 
Then g satisfies the conditions of the previous part. Apply the result 
of (a) to complete the proof. 


In the situation of Example 1, find a (two-part) formula for f’, and show 
that f’ is not continuous at x = 0. 


Prove Taylor’s theorem (Theorem 4.14) in the special case a = O and b = 1, 
as follows: 


(a) Consider the function 


Show that f(0) = p2(0), f"(0) = p4(0), and ”(0) = p$(0). 
(b) Consider the new function 
g(t) = f(t) — pa(t) + (pa(1) — FQ). 
Show that g(0) = 0 = g(1), so there exists ¢ in (0, 1) with g’(t1)=0. 
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(c) Show that g'(0) = 0 = g’(t1), so there exists t2 in (0,t,) with 
g" (t2) = 0. 

(d) Show that g’”(0) = 0 = g’(t2), so there exists c in (0,t2) with 
g’"(c) = 0. Conclude that Taylor’s theorem holds in the case at hand. 


25. Prove Taylor’s theorem (Theorem 4.14) in the general case (i.e., without 
assuming a = 0 and b = 1). (Hint: In the general case, use the given 
function f to define a new function h(a) = f(b(a — a)).) 


4.4 Sequences of Functions 


In Chapter 2, we studied convergence for a sequence {a,,} of numbers—under 
appropriate conditions, we say that the numbers a,, converge to another number a. 

A sequence { f,, } of functions may also converge or diverge to a limit function. 
We explore some basic examples before giving formal definitions. 


EXAMPLE 1. Consider the functions f\(7) = 2, fo(z) = 27, f3(2) = 2°, 
...5 fn(@) = x”. Figure 4.7(a) shows graphs of several f,, on the interval [0, 1]; 
Figure 4.7(b) shows one typical function, f29. Does the sequence { f,, } converge 
to some limit function? 


SOLUTION. As Figure 4.7(a) hints, the behavior of a sequence {f,,} depends 
strongly on the domain we specify. Viewed on the interval (0, 0.8), for instance, 
the graphs of f,, appear to become flatter and flatter, approaching the function 
f(z) = 0. At 2 = 1, on the other hand, we have f,,(1) = 1 for all n; graphs of 


(a) fn(a) = «” for many n (b) foo(x) = 220 


Figure 4.7. Exploring the function sequence { f(a) = x”}. 


4.4. Sequences of Functions 


(a) gn (x) for many n (b) g20(a) = x + sin(20z)/20 


Figure 4.8. Exploring the sequence {gn(x) = x + sin(nx)/n}. 


all the f,, pass through the point (1,1). On the domain interval [0, 1], therefore, 
the f,, appear to converge—in some sense yet to be defined—to the function 


ieee eae 
no={" pai, 


We also observe that, although each f,, is continuous on (0, 1], the limit function 
f is discontinuous at x = 1. © 


EXAMPLE 2. Consider the functions 


sin(nx) 
ars) 


9n(x) = = 1,253,253 


Figure 4.8(a) shows several g,, on the domain [—5, 5]; Figure 4.8(b) shows only gao. 


Does the sequence {g,,} converge to a limit function? 


SOLUTION. The graphs suggest—correctly—that the g,, although wavy for 
large n, approach the linear function g(x) = x. The formulas agree—for each 
n, the inequality 


1 
<a 


n 


sin(nz) 


\gn(x) — g(x)| = | 


holds for all inputs x in [—5, 5], the domain interval shown. By contrast with 
the situation in Example 1, moreover, the pictures suggest that the same sort of 
convergence occurs on every domain interval. © 


What the examples show. The examples illustrate that a sequence of functions 
may behave quite differently on one domain than on another. On the domain 
(0, 1/2], for example, the functions { f,,} in Example 1 converge to the zero func- 
tion. On the domain (0, 1], too, these functions appear to converge, but to a limit 
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Not shown in Figure 4.7, but 
easily visualized. 


We could have used any 
domain interval. 
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function that is discontinous at x = 1. On the domain [2, 3], by contrast, the same 
functions become larger and larger with n, and so have no sensible limit func- 
tion. The functions {g,,} in Example 1, on the other hand, appear to converge to 
g(x) = x on every domain interval. 


Two Definitions 


Let fi, fo, f3, ... be a sequence of real-valued functions, all defined on a fixed 
interval J. To sort out similarities and differences in the behaviors illustrated in 
Examples | and 2, we’ll define two senses in which { f,,} might converge to a 
limit function f, also defined on J. 


Definition 4.15 (Pointwise convergence). The sequence {f,,} converges point- 
wise to f on J if, for every fixed x € I, we have f,(%) — f(x) asn — oo. 


Definition 4.16 (Uniform convergence). The sequence { f,,} converges uniformly 
to f on I if for each € > 0 there exists N such that for every n > N the inequality 
|fn(x) — f(a)| < €holds for all x € I. 


How do the sequences in Examples | and 2 stack up against these definitions? 


* Both sequences converge pointwise: Both { f,,} and {g,} converge point- 
wise to their limits, at least on the domain intervals shown. For { f,,} we 


have 
lim fn(z) = lim «” =0= f(a) 


n—Co n—-oco 


for all x € [0,1). If = 1, by contrast, then 


lim f,(1) = lim 1" =1= f(1). 


n—Cco n—- oo 


Thus { f,,} converges pointwise to f on [0, 1], as claimed. 


For {gy}, the squeezing inequality 


1 sin(nx) 1 
2--— S$ 0+——— s2r+-, 
n 


which holds for all n and every fixed x, implies that 


Him. gn(e) = im (+ @EE)) — 2 = g(r) 


noo noo n 


for all x € [—5, 5]. Thus {g,,}, too, converges pointwise to a limit function 
g, this time on the domain [—5, 5]. 
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* Only one sequence converges uniformly: Both {f,} and {g,} converge 
pointwise to their limit functions; {g,,} also converges uniformly. Show- 
ing this is straightforward. For given € > 0, the number NV = 1/e works: if 
n > N, then we have 


1 


lon(a) — o(2)| = [PPOD) <2 cs ae, 


n 


and the inequality holds for all x in [—5, 5], as the definition requires. 

The sequence { f,,} does not converge uniformly to f on [0,1]. The prob- 
lem, roughly speaking, is that the limit function f “jumps” abruptly at 
x = 1, while all the f,, increase smoothly. We explore this idea more 
carefully in the next example. 


EXAMPLE 3. The functions f,(a) = x” converge pointwise on [0, 1] to the limit 
function f, with f(z) = 0 for x # 1 and f(1) = 1. Show that {f,,} does not 
converge uniformly to f on [0, 1]. 


SOLUTION. Suppose toward contradiction that { f,,} does converge uniformly. 
Set « = 0.1, choose N as in Definition 4.16, and consider the function fyy4+1. 
Then, by design, 

lfv4i(x) — f(a)| < 0.1 


holds for all x € [0,1]. Because f(1) = 1 and f(x) = 0 for x ¥ 1, the preceding 
inequality boils down to two conditions: 


@) |fvai(e)|<01 if@41; Gi |fnai(1)—1)<0.1. 


In particular, we must have either fy4i(a) < 0.1 or fyyi(x) > 0.9 for all 
x € [0,1]. This is clearly absurd—the continuous function fy4i(2) = xt! 
must assume every value between zero and one. We’ve shown that uniform con- 
vergence fails. ©) 


On the other hand... 


EXAMPLE 4. Show that the same sequence {f,,} does converge uniformly on 
(0, 0.99], with limit f(a) = 0. 


SOLUTION. Let € > 0 be given. Choose a number N such that 0.99% < ¢. Any 
such N works, because for all n > N and all x € [0, 0.99], we have 
lfn(w) — f(x)| = |x” —0| = 2” < 0.99" < 0.99% <e, 


as desired. ©) 
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This is the same limit function 
as before, but chopped off at 
x= 0.99. 


Why can this be done? How big 
is N if e = 0.001? 


208 


Consider {1/n}, for instance. 


And differentiable, for that 
matter. 
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Basic properties. Proposition 4.17 collects, as samples, some basic properties of 
pointwise and uniform convergence. Proofs and additional properties are left as 
exercises. 


Proposition 4.17. Let I be an interval, and let {f,,} and {gn} be sequences of 
real-valued functions, all defined on I. 


* Uniform convergence is stronger: If f,, — f uniformly, then f, — f point- 
wise, too. The converse is false. 


° Algebraic combinations: Suppose fn — f and gn — g pointwise on I, 
where f and g are functions. Then 


fn = Yn fg and fngn— f9; 
pointwise on I in each case. 


© Patching it together: Let I, and Ip be intervals with I = I, Ul. If fn - f 
uniformly (or pointwise) both on I, and on Iz, then fp, — f uniformly (or 
pointwise) on I. 


Properties of Limit Functions 


Must the limit of a sequence “inherit” properties from members of the sequence? 
For a sequence {x,,} of numbers, for instance, we know that if x,, > 0 for all n 
and x, — L, then L > 0, too. On the other hand, the limit of a strictly positive 
sequence need not be positive. Similar questions arise for sequences of functions. 


EXAMPLE 5. Let { f;,} be a sequence of functions defined on an interval J, and 
suppose f, — f pointwise on J. If f,(a) > 0 for all x € J and for all n > 0, 
must f(x) > 0 for all a € J? If all the f,, are continuous on J, must f be 
continuous on J, too? 


SOLUTION. The answers are respectively yes and no. If we fix any x in J, 
then {f,(x)} is a sequence of nonnegative numbers, and so its limit, f(x), is 
also nonnegative, as claimed. As for continuity, we saw in Example | that the 
continuous functions f,(~) = x” converge pointwise on [0, 1] to a discontinuous 
limit—the function f defined f(x) = Oif « # O and f(1) =1. ©) 


So the pointwise limit of continuous functions need not be continuous. The 
situation is different for uniform limits: 


Proposition 4.18. Ler { f,,} be a sequence of functions, converging uniformly on 
an interval I toa function f. If all the fp, are continuous on I, then f is continuous 
on I, too. 
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The proof idea. The formal proof is clever and slightly technical, but the basic 
idea is straightforward. Continuity of f at x = a means, roughly, that if x ~ a, 
then f(a) ~ f(a). This holds in the present case because we know (i) f(a) © 
f(a) for large n and for all x in T; (ii) fn(w) © fn(a) when x & a because f, is 
continuous at x = a. Putting these conditions together gives, for x = a, 

f(@) ® fn(@) © fr(@) © fla). 


We make this three-step approximation precise in the formal proof. 


Proof: Let a be any point of J. To show that f is continuous at a, let € > 0 be 
given. To find a 6 > 0 that “works” for ¢ in the definition of continuity, we argue 
in three steps. 

First, we choose any particular integer N such that 


|f(a) — fu(x)| < 7 for all x in I; 


this is possible because { f,,} converges uniformly to f. Second, because fy is 
continuous at x = a, we can choose 6 > 0 such that 


|fn(xz) — fn(a)| < 7 whenever |x —a| <6. 

Third, we note that, since a € I, our choice of N guarantees 
€ 
Ifw(a) — fl@)| < £. 


Putting these inequalities together shows that our chosen 6 does what we need it 
to do: if |x — al < 6, then 


[f(x) — F(@)| < |F(2) — fr (@)| + [fv (@) — fr (| + fv @ — F(@)| 


a ae 
coal ae tae 


as desired. 


Derivatives and limits. The uniform limit of a sequence of continuous functions 
is continuous, as just shown. Can anything be similar be said about differentiable 
functions? 

As Example | illustrates, the pointwise limit of differentiable functions need 
not even be continuous, let alone differentiable. The following example shows 
that the uniform limit of differentiable functions need not be differentiable, either. 


EXAMPLE 6. For each positive integer n, define f,(x) = Es aaa Figure 4.9 
shows several of the f,,; the sequence appears to approach the absolute value 
function. What is happening with derivatives? 


Watch for three «/3’s. 


By a three-part triangle 
inequality. 
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See the exercises. 


Again, see the exercises. 
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a4 “05 "05 1 


Figure 4.9. Smooth functions approaching the absolute value function. 


SOLUTION. The functions f,, behave as Figure 4.9 suggests. Writing f(z) = 
|x|, we can readily show: 


* Each f,, is differentiable at all real x. This is clear for x 4 0, and not hard 
to prove if x = 0. 


* fn — f uniformly on |—1,1]. This follows from the fact, readily shown, 
that 


Ifa) — fa(@)| <= 
for all x € [—1, 1]. 


The point is that, although all the f,, are differentiable everywhere, their limit f is 
not differentiable at the origin. Thus, differentiability—unlike continuity—is not 
preserved by uniform limits. © 


A Function of Weierstrass: Everywhere Continuous 
but Nowhere Differentiable 


One use of function limits is in exploring how “good” and “bad” behaviors can 
coexist in the same function. In Example 6, for instance, we obtained the absolute 
value function—continuous everywhere but not differentiable at one point—as the 
uniform limit of a sequence of differentiable functions. 

Using similar but more sophisticated methods, the mathematician Weierstrass 
constructed around 1872 a much stranger limit function: one that is continuous 
everywhere but differentiable nowhere on R. Weierstrass’s construction starts 
with a function sequence of the following form: 


4.4. Sequences of Functions 


(a) fo and fi: first steps (b) fo: already ragged 


Figure 4.10. Steps toward Weierstrass’s function. 


fi(x) = cos(x) + a cos (bra) 


f2(x) = cos(x) + a cos (bmx) + a? cos (b’72) 


fn(x) = 3 a™ cos (b* 2) ; 


Here a is a constant with 0 < a < 1, small enough to assure that the f,, converge 
uniformly; b is a positive integer, large enough to create rapid oscillation in the 
cosines. We omit computational details, but here are the main points: 


¢ The f,, converge uniformly on R to some function f. 
© All the f, are differentiable (and therefore continuous). 


¢ The limit function f is continuous everywhere, as Proposition 4.18 guaran- 
tees. 


¢ Thanks to rapid oscillation in the summands, the limit function is differen- 
tiable nowhere. 


Figure 4.10 shows the first few f,,, with a = 0.5 and b = 13. 


Exercises 


1. In each part show, that the given sequence { f,, } converges pointwise to the 
given limit f on the given interval [. Is the convergence also uniform? Why 
or why not? 
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(a) fn(x) = + forall n; f(x) =0; I= 
(b) fn(x) = = forall n; f(7) =0; 1 =R 
(c) fr(z) = 22 forall n; f(x) =0; 1 =R. 


2. Consider the sequence { f,, } defined by 


fied = 44 ifx>n, 


0 ife@<n. 


(a) Show that { f,,} converges pointwise on R to the zero function f(x) = 
0. Is the convergence also uniform? 


(b) Show that {f,} converges uniformly on J = [—1000, 1000] to the 
zero function f(x) = 0. 


3. In the situation and notation of Proposition 4.17, page 208, show that if 
fn — f and gn, — g, both pointwise on J, then f, + gn — f +g pointwise 
on J. Show also that the same result holds if “pointwise” is replaced with 
“uniformly.” 


4. Consider the situation and notation of Proposition 4.17, page 208. 


(a) Show that if f, — f and g, — g, both pointwise on J, then fgn — 
fg pointwise on I. 


(b) Show that if f, — f and g, — g, both uniformly on J, then { frgn} 
need not converge uniformly J to fg. (Hint: Try fn(2) = gn(a) = 
(a + 1/n) and I = R.) 


5. This problem is about Example 6, page 209; consider the functions f,, de- 
fined there. 


(a) Let n be any positive integer. Show that f,(x) = |a|'t!/” is dif- 
ferentiable at x = 0, with f’(0) = 0. (Hint: Look at both left- and 
right-hand limits of the difference quotient for f/ (0).) 


(b) Let n be any positive integer. Show that | f(x) — f,(x)| < + for all 
€ [-1,1]. (Hint: It is enough to show, using elementary calculus, 
that 2 — 21t1/" < 1/n for x in [0, 1].) 
(c) Use the inequality in the preceding part to show that f, — f uni- 
formly on [—1, 1]. 


6. Prove (informally) or disprove (with a counterexample) the following state- 
ments about limits of sequences of numbers. 


4.4. Sequences of Functions 


(a) If x, € Z for all n and x,, — L, then L € Z, too. 
(b) If x, ¢ Q for all n and x, > L, then L ¢ Q, too. 
(c) If tp, € [—-3, 42] forall n and x, — L, then L € [—3, 42], too. 


. Let f : R — R be continuous on R, and define the sequence {f,,} by 


fn(z) = f(a+1/n) forn EN. 


(a) Show that f,, — f pointwise on R. 


(b) Give examples to show that { f,,} may or may not converge uniformly 
to fon R. 


. Let {f,,} be a sequence of functions with f,, — f pointwise on an interval 


I. Consider the new sequence {g,,} defined by gn(x) = sin ( fn (a) ) and 
the function g(x) = sin ( f(a) ). Show that g, — g pointwise on I. 


. Let {f,,} be a sequence of functions with f,, — f uniformly on an interval 


I, and let h : R — R be uniformly continuous on R. Consider the new 
sequence {g,,} defined by g,(x) = h( f(a) ) and the function g(x) = 
h( f(a) ). Show that g, — g uniformly on J. 


. In the situation of Proposition 4.17, page 208, suppose that f, — f and 


Jn — g, both pointwise on I. Suppose that for all « € I we have g(x) 4 0 
and g,(x) 4 0 for all n. Show that f,/gn — f/g pointwise on I. 


. Let { fr } a sequence of real-valued functions defined on an interval I. Show 


that { f,,} converges pointwise to a function f defined on I if and only if, 
for each x € I, the sequence { f,,(~)} is Cauchy. (Such a sequence is said 
to satisfy a Cauchy criterion.) 
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Integrals 


5.1. The Riemann Integral: Definition 
and Examples 


Familiar Ideas Revisited 


Expressions and calculations like 


: a ° 8 124 
[ soa. i sinzdz, and [ fa= | = — 
0 0 1 3 ah 3 


are familiar from elementary calculus. In this chapter we interpret the integral, de- 
fine it rigorously, and explore some of its properties—including the fundamental 
theorem of calculus, which justifies calculations like the one above. 


Notes on notation. If f(x) = x, then the expressions 


[ te), [ io. [ eae, and ie 


look a little different, but all four mean the same thing. The first two forms use dif- 
ferent variable names, but these choices are arbitrary—all four expressions have 
the same numerical value. For simplicity and economy, we’ ll often drop the vari- 
able name entirely, and use the last form. We’ ll use the other forms when we want 
to emphasize a variable name or, as in the third form, when no specific function 
name is given. 


Riemann’s and other integrals. In this book (and in elementary calculus) “‘inte- 
gral” means “Riemann integral,” after the German mathematician G. F. B. Rie- 
mann (1826-1866), who first defined integrability rigorously. Riemann’s mathe- 
matical accomplishments, despite his short life, ranged across the discipline. One 
of his conjectures in number theory, now known as the Riemann hypothesis, re- 
mains after 150 years among mathematics’ most important unsolved problems. 
And not for lack of trying: A $1 million prize for its solution, offered by the Clay 
Mathematics Institute, lies unclaimed. 
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Draw your own pictures to 
illustrate the possibilities. 


Is the last integral slightly 
positive or slightly negative? 
Why? 


Draw your own pictures for the 
first two integrals. 
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The phrase “Riemann integral” honors a person, but it also distinguishes one 
particular approach to integration from several others, each with its own features 
and (depending on one’s viewpoint) bugs. Among important alternatives to Rie- 
mann’s integral are the Lebesgue and the Henstock integrals, developed around 
1900 and 1950, respectively. These integrals “agree with” the Riemann integral 
for all the standard functions of calculus, but they are more general in the sense 
that they handle larger classes of functions. More advanced courses in analysis 
treat such integrals carefully, but we will stick with Riemann’s version. 


Areas and integrals. The most familiar view of integrals from elementary calcu- 
lus involves area: If f(x) > 0 for x € [a,b], then es f(a) dx measures the area 
above the x-axis, below the curve y = f(a), and between the vertical lines x = a 
and x = b. If f(x) < 0 for some inputs, then area below the z-axis is involved, 
and counts as negative. 

Thinking of integrals as areas will often be helpful for us, too. This view 
suggests—correctly—that 


iC 3 15 3.14 
/ 3dx = 18, i (1+<2)dr= cy and ‘f cos(x) dx = 0, 
1 0 0 


assuming, as we do for now, that all three integrals exist. But extra care will be 
needed for other functions, whose graphs may be ragged or broken, not smooth. 


Defining the Integral 


What do statements like 


3 wT b 
: 2edz = 9, | sinzdx=2, and i f=-7 
0 0 a 


really mean? For the first integral, the integral-as-area view is enough—9 is the 
only reasonable answer. For the second integral, the area view suggests 2 is possi- 
ble, but is it exact? Why not 2.034? Or 1.957? For the third integral, the negative 
result and elementary calculus intuition suggest that f(a) is in some sense more 
negative than positive for x in the interval [a, b]. But in what sense is the answer 
exactly —7? 

Resolving all these questions asks a lot of the integral, and so it is no surprise 
that the recipe is complicated. Here, first, we assemble the ingredients: 


° Integrand and interval: The definition requires an integrand—a real-valued 
function f : [a,b] — R defined for all inputs x in a closed and bounded 
interval [a, b]. Note, in particular, that although integral equations like 


[ = 1 and ie Dag 2 
—_—- ar = — 6 
1 @ o Va 
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are sometimes seen in elementary calculus, the integrals in question are not That's why we call them 
of the type considered here. “improper.” 


* Partitions, subintervals, and norms: A partition P of an interval [a, b] is 
any choice of n + 1 points {x9,71,...,@,}, arranged in increasing order 
from a to b: 


A=% <4, <%2 << +++ < Bn_1 < Xn =D. 


We'll often just write P = {xo,271,%2,...,%n} to emphasize that P is a 
set; the smallest-to-biggest ordering is understood. 


The object of choosing P is to chop [a, b] into n smaller closed intervals, “Partition” is the polite term. 
meeting only at their endpoints: 


[a, b] = [xo, 21] U [1, G2] U-+- U [ei-1, Bi] U- ++ U [2n-1, En]. 


We'll call [a;-1, x;] the i-th subinterval; its length is the positive number 
Ax; = 2; — t;-1. If the points of P happen to be regularly spaced, then 
the partition is called regular, and Ax; = (b — a)/n for all 7. For any 
partition, regular or not, we have 


Ag, + Arg +-:-+ Arty = 2%n — 2X9 = b-a. 


The norm of a partition P, denoted ||P||, is the largest of the Ax;. Requir- Also known as the mesh. 
ing that ||P|| < 4, as we will in the definition, ensures that every subinterval 
is narrow. 


* Sampling points: Given any partition P = {20,271,%2,...,Un} of an in- 
terval [a, b], we can choose sampling points 


81 € [%o,%1], $2 € [21,22], ---, Sn © [Un-1, Ln], 


one from each subinterval of P, to form a full set S = {s1, 52,..., Sn} of 
sampling points from P. 


There are many, many ways to choose a sample S from any given parti- 
tion P. We could choose each subinterval’s left endpoint, its right endpoint, 
or its midpoint, or any combination. Or we could throw an imaginary dart 
into each subinterval to choose a sampling point at random. A partition 
endpoint, say x42, might even serve twice as a sample point, representing 
both [v41, X42] and [xa2, X43]. 


© Riemann sums: Given all the data (a function f, an interval [a, b], a parti- 
tion P, and samples S) just described, a Riemann sum is an expression of 
the form 


So f(s) Avi = f(s1)Aa, + f(se)Ave +--+ + f(Sn)Atn. 
i=1 
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Even the shorthand is slightly 
cumbersome. 


We don’t know “officially” yet 
that the integral exists. But it 
does. 


Chosen at random by 
Mathematica. 
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a sy xy $2 X83 x3 Sq X45 b 


Figure 5.1. A generic Riemann sum. 


As a shorthand we’ll sometimes use the notation RS(f,P,S) to denote a 
Riemann sum with all these data. When all the data are given explicitly, 
RS(f,P,S) reduces to a number. Under good conditions—as specified 
in the coming definition—we expect this number to approximate an inte- 
gral. 


A Riemann sum is a sort of “representative sample” of the function f on 
[a, b]: We chop [a, 6] into smaller pieces, evaluate f at one point of each 
piece, and add up the results, “weighting” each value f(s;) with the “size” 
of the subinterval it represents. Figure 5.1 illustrates the familiar idea. 


EXAMPLE 1. Explore some Riemann sums for fe sin(a) da. 


SOLUTION. The integral involves the integrand f(a) = sin x and the interval 
[0,2]. The simplest and least interesting partition of [0, 2] involves no chopping: 
P = {xo, v1} = {0,2} has just one subinterval, and ||P|| = 2. Here we need 
just one sampling point, say, s; = 1.3657, and the corresponding Riemann sum 
is simply 

f(s1)Av1 = sin(1.3657) - 2 = 1.9581. 


With so little work or thought invested, we can’t expect much return from the 
answer. 
Let’s work (just) a little harder. With 
P = {0,0.8,1.0,1.7,2} and S = {0,0.81, 1.69, 1.73}, 


we have ||P || = 0.8 and the Riemann sum RS(f,P, S) becomes 
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S— f(si)Aai = sin(0) - 0.8 + sin(0.81) - 0.2 + sin(1.69) - 0.7 + sin(1.73) - 0.3 
= 1.57. 


With the same partition, but new samples S = {0.8,0.8, 1.7, 1.7}, the Riemann 
sum becomes 


sin(0.8) - 0.8 + sin(0.8) - 0.2 + sin(1.7) - 0.7 + sin(1.7) - 0.3 © 2.14, 


which isn’t all that close to earlier estimates. 
Now let’s get serious. With a regular, 20-piece partition (so ||P|| = 0.1) and 
midpoint samples, we have 


P = {0,0.1,0.2,...,1.9,2} and S = {0.05,0.15,...,1.95}, 
and the Riemann sum works out to 
sin(0.05) - 0.1 + sin(0.15) - 0.1 + ---+ sin(1.95) - 0.1 = 1.42. 


With right endpoint samples S = {0.01,0.02,..., 2.0} from the same partition 
the result is not much different; Mathematica gives about 1.421. These numbers, 
and all the extra work, deserve more credibility. ©) 


We are ready at last for the formal definition. 


Definition 5.1. Let [a,b] be a closed interval, f : [a,b] — Ra function, and I a 


real number. Then f is integrable on |a, b], and we write ih f =I, if for every 
given € > 0 there exists 6 > 0 such that 


IRS(f,P,S) —I|<e 
whenever P = {Xo0,1,...,2y} is a partition of [a, b] with ||P|| < d and S = 


{s1,--.,5n} is any set of samples from P. 


Illustrating the Definition 
The definition is undeniably complicated, but simple examples will help us un- 


pack it, and reveal some pleasant surprises. 


EXAMPLE 2. Let f(a) = 3 for all x and [a,b] = [1,7]. Thinking of area 
suggests that : f = 18. Does the definition agree? Can the idea be generalized? 
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Thanks, Mathematica. 


Rightly so. The exact value, as 
we will be able to prove soon, is 
1-—cos 2 = 1.41615. 
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SOLUTION. Yes, and yes. 
If P is any partition of [1,7] and S is any set of sampling points, then 


n 


i=1 i=1 


i=l 


which is exactly the desired answer. Thus, for a given € > 0 we can choose any 
positive 6, say 6 = 42, and the definition is satisfied. © 


There’s nothing special, of course, about the data in the preceding example. 


Proposition 5.2. If f(a) = k is any constant function and (a, b| any interval, then 
f is integrable on (a, b|, and 


[ kav =k: (0-0) 


Nonnegative integrands, nonnegative integrals. Following is another basic prop- 
erty of integrals that follows directly from the definition. 


Proposition 5.3. Suppose f(x) > 0 for all x € |a, b). yf f exists, (a f>0. 


Proof: The idea is simple: since all Riemann sums for J = i f are nonnegative, 
J itself must be nonnegative, too. The formal proof takes some care. 

Let « > 0 be given. Choose 6 > 0 that works for this € in the sense of 
Definition 5.1. If we choose any Riemann sum RS (f,P,S) with ||P|| < 6, then 
we must have |RS(f,P,S) —I| < «. Because f(x) > 0 for all x € [a, 8], it 
is clear that RS(f,P,S) > 0, too, and so we must have J > —e. Since this 
inequality holds for all positive «, we must have I > 0. 


Stranger integrands. In Proposition 5.3 we assumed that the integral exists. De- 
ciding which integrals exist takes some work, and we start slowly. The following 
integrand is discontinuous, but only at one point. Does it matter? 


EXAMPLE 3. Let 


2 
Does | f exist? 
0 
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SOLUTION. Thinking about area suggests Te f = 0; the graph of f seems too 
skinny to bound appreciable area above the x-axis. 

Proving this is not difficult. Let « > 0 be given, set 6 = €/2, and let P = 
{£0,21,..-,2n} be any partition with ||P|| < 6. We need to show that if S = 
{s51, 52,..., Sn} is any set of sampling points for P, then the associated Riemann 
sum satisfies 

—e< f(si)Agi +--+ + f(Sn)Aty < €. 
Since all of the f(s;) Aa; are nonnegative, it is enough to prove the right-hand 
inequality. 

Now f(s;) = O unless s; = 1, in which case f(s;) = 1. Thus, each summand 
f(s:)Az; is either 0 or Ax;. Because the “offender point” 2 = 1 can lie in at 
most two subintervals, say [w;~1, v;| and [#;, 7;41], our Riemann sum can have at 
most two nonzero summands. Adding everything up gives 

So f(siAws = f(si)Aai + f (sip) Atins 
i=1 


€ € 
S$ Aaj + Artin <5 +5 =6 


as desired. ©) 


Example 3 shows that one discontinuity is forgivable in an integral. What 
about many? 


EXAMPLE 4. Recall our old friend 


@ {1 trea 
Vo ite dQ. 


1 
Does | g exist? 

0 
SOLUTION. No. The problem, roughly speaking, is that for every partition P = 
{xo,21,%2,-.--,;%n} of [0,1], no matter how “fine,” there are both rational and 
irrational numbers inside every subinterval [x;_1, x;]. If all of the sample points 


8; are chosen to be rational, then g(s;) = 1 for all 2, and the corresponding 
Riemann sum works out to 


S(f,P.8)=Srals jAg; = ye Az; = 1. 
i=l 


If, instead, all the s; are irrational, then we get 


S({,P.S) = Yale) aes ian 0. 
w=1 
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Draw your own. 


The left-hand inequality is 
obvious. 


Very likely, all summands are 
zero. 


Can you guess anything from 
the graph of g? 
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Why the denominator of ten? 
Read on. 
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1 2 3 4 


Figure 5.2. A discontinuous integrand. 


This situation—widely varying Riemann sums even for “fine” partitions—is in- 
compatible with Definition 5.1. o 


The last example involves another type of discontinuity. 


EXAMPLE 5. Figure 5.2 shows the function 


_ fs ifxe [1,3], 
=f? if x ¢ [1,3]: 


a region of interest is shaded. Show that f is integrable on [0, 4], with i. f =10. 


SOLUTION. We know already from the discussion in Example 2 that f is inte- 
grable on [1,3], and that [. : f = 10. We need now to show that f has the same 
integral over a larger interval. This should seem reasonable if we think in terms 
of area; the (slightly laborious) proof makes this intuition precise. 

Suppose, then, that « > 0 is given. We’ll show that 6 = €/10 works in the 
sense of Definition 5.1. To see this, let P = {xo, ..., 2} be any partition of 
(0, 4] with ||P|| < 6, let S = {s1,..., 5,} be any choice of sampling points 
within P, and let RS (f,P,S) be the corresponding Riemann sum. To prove our 
claim, we need to show that 


10—e€< RS(f,P,S) < 10+.€. 


We’ ll tackle the left-hand inequality and leave the right-hand one as an exercise. 
Because P is a partition of [0,4], we can choose among the partition points 
the first x; with x; > 1 and the last x; with x; < 3. If, say, these points are x42 


5.1. The Riemann Integral: Definition and Examples 


and 2237, then we have 
me <4, <1 << aan < 43 < +++ < Logg < L037 <3 < Logg <.... 


This implies that 1 < s; < 3 fori = 43,44,...,237, and so f(s;) = 5 for all 
these i. Moreover, because ||P || < 6, we know that 


C42 <1+6 and 3-6 < £937, 
and so £937 — ®42 > 2 — 26. Now we can calculate: 


RS(f,P,S) = f(s1)Aai +--+ + f(sn)Atn 
> f(s43)Aarag + +--+ f (S237) Ax237 
= 5Agrag +--+ 5Ax37 
= 5 (#237 — £42) 
5 (200) = 10 10s dd 


as we aimed to show. ©) 


Lessons from the examples. The examples illustrate one pleasant and one less 
pleasant property of integrals. The good news is that the Riemann integral is rel- 
atively forgiving of minor misbehavior, such as occasional discontinuities, in an 
integrand. Less convenient is the fact that proofs and calculations using the def- 
inition, even for quite simple integrands, can be messy and technical. The moral 
is that we’d like to have both simpler tests for integrability and more efficient 
methods of calculating integrals. We’ll find some in the following sections. 


Which Functions are Riemann Integrable? 


Deciding precisely which functions have integrals and which don’t turns out to be 
a very hard problem; we won’t tackle it in full generality. Partial answers, how- 
ever, are interesting and accessible—and sometimes surprising. We will show, 
for instance, that every continuous function (and hence every differentiable func- 
tion) on an interval [a, b] is indeed integrable. We’ll see, too, that every monotone 
function f : [a,b] — R is also integrable—even though such an f might have 
infinitely many discontinuities on [a, }]. 

We will study the question of integrability more carefully in Section 5.3. 
Meanwhile, here’s one useful criterion. 


Proposition 5.4. If f is integrable on |a, b], then f is bounded on |a, b]. 
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all terms are nonnegative 


the A x; “collapse” 


As we just saw in Example 5. 


Try drawing a 
monotone-but-nasty function. 
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About the proof. We sketch the proof of a special case, leaving generalities 
to the exercises. Suppose, say, that io = 42. Now let « = 1. Choose 
6 > O that works for this « in the sense described in Definition 5.1, and let 
P = {x0,%1,%2,... ,X,} be any partition of [1,3] with ||P|| < 6. If we can 
show that f is bounded on each of the n subintervals [a;_1, 7], we can conclude 
that f is bounded on ail of [1, 3]. 

tis the only variable; all the x; To see why f is bounded on, say, |2¢, x7], consider the expression 


are constants. 
S(t) = f(a1)Aai+---+f(ae)Are t+ f(t)Av7t+ f(ag)Avgt+---+ f(an)Arn. 


Here is the key idea: For each t € [6,27], S(t) is a Riemann sum, based on the 
partition P, for the integral is f, and so Definition 5.1 guarantees that 


41 < S(t) < 48, 


which means that S(t) is bounded for t € [xg, x7]. This implies, as desired, that 
f(t) is bounded for t € [x6, x7]. 


Exercises 


1. Let f(x) = x? and consider the integral I = [, : f, (We know from elemen- 
tary calculus—and we’ll prove soon—that I = 9.) 


(a) Use the partition P = {0,0.8, 1.0, 1.7,2, 2.9, 3} and the sample set 
S = {0,1, V2, V3, 2,3} to evaluate the Riemann sum RS(f,P, S). 
What is ||P ||? 

(b) Using the same partition P = {0,0.8, 1.0, 1.7, 2, 2.9,3}, choose an- 
other sample set S for which the RS(f, P, S) is as small as possible. 
How small can RS(f, P, S) be? 

(c) Using the same partition P = {0,0.8, 1.0, 1.7,2,2.9,3}, choose a 
sample set S for which RS(f, P, S) approximates J (the exact value is 
nine) as badly as possible. How much error can RS(f, P, S) commit? 

(d) Using the (one-piece) partition P = {0,3}, choose a (one-member) 
sample set S for which RS(f,P, S) = 9. 


2. Let f(x) = «x and consider the integral J = oie f, (We know from ele- 
mentary calculus—and we’ll prove soon—that J = 8. See also Problem 12 
below.) 


(a) Using the partition P = {0,1,2,3,4}, find a sample set S for which 
RS(f,P,S) = 8. 


(b) Using the partition P = {0,1,2,3,4}, find a sample set S for which 
RS(f,P,S) = 9. 
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(c) For P = {0,4} find a sample set S so that RS(f,P,S) = 0. 

(d) For P = {0,4} find a sample set S so that RS(f,P,S) = 8. 

(e) Is there a two-piece partition P = {0,21,4} and a sample set S for 
which RS(f,P,S) = 0? Why or why not? 


(f) Describe a 100-piece partition P = {0,21,...,%99,4} and a sample 
set S for which RS(f,P,S) < 0.01. 


3. Suppose that f is integrable on [0,2] and that f(a) > 3 for all x € [0,2]. 
Show that fe f => 6. (Mimic the proof of Proposition 5.3, page 220.) 


4. Use Definition 5.1 to prove Proposition 5.2: If f(a) = & is a constant 
function, then f is integrable and ie kdza =k-(b-a). 


5. Let f(z) = 0 for « 4 0 and f(0) = 42. We explore the proof that f is 
integrable on [0, 1] and fo f=0. 


(a) Let P = {0, 0.001, 0.002, ..., 1} be a 1000-piece partition of [0, 1], 
and let S = {s1, 52, ..., $1000} be any corresponding set of samples. 
What are the possible values of RS(f, P, S)? 

(b) Let P = {0,21,...,%n—1, 1} be any n-piece partition of [0,1], and 
P = {s1,..-,8n} any set of samples. Explain why we must have 
either RS(f,P,S) = 0 or RS(f,P,S) = 4221. 

(c) Let « > 0 be given and set 6 = €/42. Show that if ||P || < 6 and S = 
{s1,..., Sn} is any set of samples from P, then0 < RS(f,P,S) <. 
Conclude that i f=0. 


6. Let f(x) = Ofora 4 Oand f(1) = 42. Proveas follows that f is integrable 
on [0,2] and he f=0. 


(a) Let P = {0, 0.001, 0.002, ..., 1.999, 2} be a 2000-piece partition 
of [0,1], and S = {s1, 52, ..., $2000} any corresponding set of sam- 
ples. What are the three possible values of RS(f,P,S)? 

(b) Let « > 0 be given and set 6 = €/84. Show that if ||P || < 6 and S = 
{s1,..., Sn} is any set of samples from P, then0 < RS(f,P,S) <e. 
Conclude that fe f =0. 


7. Suppose g is integrable on [a, b], with dk g = 0. Suppose 0 < f(x) < g(x) 
for all x. Show that f is also integrable, with i f=0. 


8. What is the converse of Proposition 5.3? Is it true? If so, why? If not, give 
a counterexample. 
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9. Prove that the value of J in Definition 5.1 is unique. (In other words, if both 
I; and I satisfy the definition, then J; = J.) 


10. Suppose that J = rp f exists. Show that the following “Cauchy condi- 
tion” holds: for all « > O there exists 6 > O such that if P; and P2 are 
two partitions of [a, b] with ||Pi|| < 6 and ||P2|| < 6, and S; and S2 are 
corresponding sample sets, then |RS(f,?1,51) — RS(f, Pe, S2)| <. 


11. Suppose that f is continuous on [a,b]. We will show later that f is also 
integrable on [a, b]; here are some steps in that direction. 


(a) Explain why the following condition holds: for any given € > 0, there 
exists some 6 > 0 such that if s and ¢ are in [0,1] and |s —t| < 4, 
then | f(s) — f(t)| <e. 

(b) Let € and 6 be as in the preceding part, let P = {20, 11, 2, ..., Un} 
be any partition with ||P|| < 6, and let S = {51, 52, ..., Sn} and 
T = {t1, te, ..., tr} be two different sample sets for P. Show that 


IRS(f, Bs S) = RSP Ps T)| < e(b ane a). 
12. Show as follows that f(x) = x is integrable on [0, 1], and that fo f =1/2. 


(a) Let P = {x0, 21, U2, ..., 2n} be any partition of [0, 1] (so zo = 0 
and x, = 1). Let M = {my, mo, ..., mn} be the sample set whose 
elements are the midpoints of their respective subdivisions. (For ex- 
ample, m7 = (#7 + x6) /2.) Show that RS(f,P,M) = 1/2. 

(b) Let « > 0 be given. Show that 6 = e€ works in Definition 5.1 to 
show that J = fo f = 1/2. (Use the preceding fact and part (b) of 
Problem 11.) 


13. Generalize ideas in Problem 12 to show that f(z) = x is integrable on 
every interval {a, b], and that if x dx = (b? — a?)/2. 


14. Let f(x) = 1/xif « # 0 and f(0) = 0. Proposition 5.4 says that f is 
not integrable on [0, 1]. To illustrate this, show that, for any partition P = 
{0, 1, ..., @n—1, 1}, there is some sample set P = {81, 52,..., Sn} 
such that RS(f, P2,S2) > 1000. 


5.2 Properties of the Integral 


Basic properties of the definite integral are familiar from elementary calculus— 
where they may be taken for granted. Using the formal definition, we can readily 
give rigorous proofs. 


5.2. Properties of the Integral 


Algebra with Integrals 


It is standard operating procedure in calculus to break complicated integrands 
apart, integrate the pieces separately, and then reassemble the results. For exam- 
ple, we can write 


1 1 1 
| On ee es av =7 | : ax 3 f aoe 
9 \e@tl «+1 9 +1 9 22+1 


T 
= 7|n2—3-— 
71n 37> 


assuming (for now) that all three integrals in question exist. In elementary calcu- 
lus we may skip over some theoretical niceties, but now we can do things right. 
The next theorem guarantees that integrable functions (like differentiable func- 
tions, continuous functions, and other objects we have studied) “behave well” 
with respect to addition and scalar multiplication. 


Theorem 5.5. Suppose that functions f and g are integrable on (a, b], and let k 
be any number. Then the new functions kf and f + g are also integrable on (a, bl, 


ith 
“ [unefs and furo= firs fs 


Integration is linear. Before proving the theorem we observe that we can apply 
it repeatedly to handle more complicated linear combinations. If, say, f, g, and 
hare all integrable on [2,5], then the following equation makes good sense—and 


it’s true: 
5 ‘ 5 5 nf? 
3f—7 —h)=3 —7 — h. 
[ (st-t9+ Fn) a3 for-7 fort f 


A fancier way to state these ideas uses the language of linear algebra. The set 
V of integrable functions on |a, b] is a vector space, and integration on [a, }] is a 
linear transformation from V to R (another vector space!). 


Proving Theorem 5.5. We’ll sketch the proof for sums and leave the rest to the 
exercises. For brevity we write I for ie f and I, for f g. 
Let € > 0 be given. Since Jy and I, exist, there are positive numbers 6+ and 


dg such that, for any sampling points s1,..., Sn, 
” € 
P\|l<o; => ;)Aa; —Tz| < =; 
IPI <8; — [Qo fede. - ty] <5 
” € 
|P || < og = S > g(si)Axi i Ig < 3 
i=1 
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Use basic calculus to check the 
last identity, if you like. 
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Watch for the triangle inequality. 
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Now let 6 = min{dy,6,}; we'll show that this 6 “works” in the sense of Def- 
inition 5.1 for the sum function f + g. To do so, consider any partition P = 
{@0,%1,%2,... Gn} with ||P] < 4, and let S = {51,82,... 8,} be any corre- 
sponding choice of sampling points. Now we calculate: 


n 


S° (f(si) + g(si)) May — (Lp + Jy) 


i=1 
=|S— f(s Vie SES eke 


t=1 1=1 


“len s;)Aa, — If] + 


s;)Aa,; — Ig 


This shows what we claimed: f + g is integrable on [a, b], with integral I + Ig. 


Bigger integrands, bigger integrals. Theorem 5.5 has a simple and natural corol- 
lary; we leave the proof to the exercises. 


Corollary 5.6. Let f and g be integrable functions on |a, }}. 


b b 
If f(x) < g(x) for all x € [a,b], then / f< ‘| g 


Combining ideas from Example 3, page 220, and Theorem 5.5 produces some 
possible surprises. 


EXAMPLE 1. Suppose 
f)=1, f2)=2, f(3)=3, ..., f(000) = 1000 
and f(a) = 0 for other x. Does i f exist? 


SOLUTION. Yes, and i aed = 0. Example 3, page 220, slightly modified, 


shows that functions like f17 : [0, - 1000] — R, given by 
17 ifv=17, 
fir(a) = is otherwise 


1000 


are all integrable on [0, 1000], and that if fiz = 0. Our given f satisfies 


f= fit fot--++ fiooo, 
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and so Theorem 5.5 gives 


1000 1000 1000 1000 
i) f=) fit f fatoot | frooo 
0 0 0 0 


=0+0+---+0=0, 


as desired. ©) 


EXAMPLE 2. Suppose f is integrable on [a,b], with i f = 42, and suppose 
g(x) = f(a) for all but finitely many z in [a, 6]. What can be said about iP g? 


SOLUTION. In this case, iC g = 42, too. To see why, consider the difference 
function g — f. Since g(a) — f(x) = 0 for all but finitely many «, the method of 


Example | shows that 9 — f) = 0. By Theorem 5.5, 


fo-frre-n= [r+ fe-n-ar0-0. 


Here is another way to express the result: If Bs f = 1, then altering f(a) at 
finitely many points in [a, b] leaves the integral unchanged. © 


Joinery 


Another way to build new integrals from old is to stick “pieces” of various func- 
tions together. Interestingly, the pieces need not fit together continuously. In this 
sense, the integral is more forgiving than the derivative, which requires smoother 
joinery. 


EXAMPLE 3. Figure 5.3 shows a function f formed—clumsily—from three sep- 
arate pieces. What can be said about te f? 


SOLUTION. Despite the breaks at 2 = 1 and x = 3, the resulting function is 
integrable on (0, 4], and can be integrated piece by piece: 


[sq fore firs [r-aszsane 


Justifying such calculations rigorously takes a little effort. The main idea is in the 
next lemma. ©) 
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Draw your own picture of a 
typical fand f 


The e/2 will come in handy later. 
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1 2 3 


cal 


Figure 5.3. Clumsy joinery. 


Lemma 5.7. Suppose that f : [a,b] — R is integrable, with ft f = I. Assume 
c > b, so [a,b] C [a,c], and consider the “extended” function f : [a,c] > R 
defined by 

f(x) ifxe [a,b], 
(x) = 

0 ifa > b. 


Then f is integrable on [a,c], and 


The live issue, by the way, is integrability of f. Once we know that, it is no 
surprise that extending f in the way described doesn’t change the value of the 
integral J. The proof—like others that involve the definition of the integral—is a 
bit messy, but it is worth sticking with as an example of its type. The main idea is 


that Riemann sums for iM f can’t differ by much from Riemann sums for i" ‘e 


Proof: Let « > 0 be given. We need to find 6 > 0 so that if P is any partition of 
[a, c] with ||P || < 6, and S is any choice of sampling points for P, then 


<e€. 


i=1 


; Bog Ais b 
Let’s use what we know about the original integral i f to produce such a 6. 


: : b : 
First, since Ai f exists, we can choose 67 > 0 so that 


m 


i=l 


Ze 
2 
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holds whenever P’ = {20,2%1,... Um} is a partition of [a,b] with ||P’|| < On, 
and the sampling set S’ = {s1, 2,... 5} is drawn from P’. Second, since ‘ie f 
exists, we know f must be bounded on [a, b], so we have |f(x)| < M for some 
positive number M and all x € [a, }}. 

Our goal is now visible in the distance. We’ll show that the positive number 


6 =min { 5;, 7} 


has the desired properties. To this end, let P = {21,x2,... %»} be any partition 
of [a,c] with ||P|| < 4, let S be any selection of sampling points from P, and 
consider the corresponding Riemann sum 


oi s;)Aa; = f(si)Aa, + f(s2)Ave +--+ + f(Sn)Atn. 


We need to show that S lies within ¢€ of J. To do so, let x;, be the /argest member 
of P that does not exceed 0; that is, 7;, < b < 2,41. Now for 7 > i9 + 1 we 
know f(s;) = 0, while for i < ip we have f(s;) = f(s;). With these facts in 
mind, we can rewrite S: 


S = f(si)Av, + f(s2)Are +--+ + f (Sig Atig + F(Sin +1) Aig 41- 


F ; : b Pea 
This form reveals that S' is almost a Riemann sum for le f. Indeed, the similar 
sum 


S’ = f(si)Ari + f(s2)Ave +--+ + f (sig Arig + f(b) (b— rig) 


: : b : Oe 
is a Riemann sum for y f, corresponding to the new partition P’ = {a,21, 
LQ, +++, Lig, b}; note that ||P’|| < 6. (If x;, = 6 then the last summand in S’ 
vanishes. ) 


To finish the proof, we observe that, by our hypothesis, 


|S’ -—I|< a oo 


Also, S and S’ differ by (at most) two terms, each of which is small: 
|S — S"| = |f (Sin +1) Avion: — f(b) (6 — 2ig)| 
< | F(Sio+1) Avig41| + |F(0) (b- aio) 
€ 
Mé6+ M6 < 2M— = -. 
<Md+ < Vi 


Combining this with inequality (*«*) gives 


IS—I|<|S-S|+|9'-1]<5+5=6 


as desired. 
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Here a= Xo and b= Xm. 
So says Proposition 5.4, 
page 5.4. 


Strange as it may look at first 
glance. 
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Lemma 5.7 lets us prove the theorem we really want. 


Theorem 5.8. Let f : [a,b] — R be a function, and let c be a number between a 
and b. If both es fand f f exist, then r f exists, too, and 


[s-farfs 


Proof: Define f; and f2 on [a, b] by 


nce = {0 ifa<a<e, rte) ={! ifa<a<e, 


0 ifc<a2<b; f(z) ife<a<b. 


Note that f(x) = fi(x) + fe(x) for all x € [a, b]. Now Lemma 5.7 says that both 
fi and fo are integrable on [a, b], and that 


fa- fs and fa-fs 


To finish up, we add up integrals, as Theorem 5.5 permits: 
b b b c b 
[tof asf n-frr fs 


Theorem 5.8 needs an important caveat: The add-up-the-pieces strategy is 
guaranteed to work only if we know in advance that the “smaller” integrals 
i f and ie f exist. (We do know this in Example 3, in which f is constant 
on each smaller subinterval.) In particular, we have not proved the tempting (and 
true) fact that if f is integrable on [a, b], then it is also integrable on any smaller 
interval [a,c]. We will prove this in the next section, where we consider more 
carefully the question of which functions are integrable. 


as claimed. 


Antiderivatives and Integrals: Toward a Fundamental 
Theorem 


Elementary calculus experience suggests, almost irresistibly, that integrals and 
antiderivatives are closely connected. So they are, and the following result, a 
junior-grade version of the familiar fundamental theorem of calculus, begins the 
connection. 

But there’s more to say: a stronger version (Theorem 5.19, page 250) appears 
in the next section. 


5.2. Properties of the Integral 


Theorem 5.9 (Fundamental theorem of calculus, version 0). Let f : [a,b] — R 
be a function, and suppose 


(i) f is integrable on (a, bj; 


(ii) there is a function F : [a,b] > R, continuous on {a, b| and differentiable on 


(a,b), such that F’(x) = f(x) for all x € (a,b). 
Then [° f = F(b) — F(a). 


Proof: By our hypotheses, both of the numbers J = iE f and F(b) — F(a) exist; 
we need to show they’re equal. We’ll do so by proving that 


|F(b) — F(a) -I| <e 


for every positive €, no matter how small. 
Let « > 0 be given. Because J satisfies Definition 5.1, we can choose 6 > 0 
as described there. Now let 


P = {xo = 4,21, %2,... In = b} 


be any partition with ||P || < 6. 

The proof uses a clever idea: If we choose the sampling points s1, 52,..., 5n 
just right, the associated Riemann sum adds up, after some fancy telescoping, to 
none other than F'(b) — F(a). Once this is done, Definition 5.1 guarantees that 
F'(b) — F(a), like every Riemann sum associated with P, lies within ¢ of I. Since 
¢ is arbitrary, we must have F'(b) — F(a) = I, as claimed. 

To choose these special sampling points, we apply the mean value theorem 
(MVT) to our antiderivative function F’, once on each of the n partition subin- 
tervals. Hypothesis (ii) assures that the MVT applies. For each subinterval 
[x;-1, v;], the MVT says that for some s; in (x;_1, x;), 


F(2x;) = F(aj-1) = F'(s;) (aj i Li-1) = f(si)Aa;. 


Adding all n of these results gives the desired Riemann sum: 


n 


> f(s Ax = 2 F(24) — F(ai-1) 


i=l 


= F(21) F (x0) t F (22) F (a1) t -++ F(an) — F(an—1) 
= F(a2,) — F(xo) = F(b) — F(a). 


This completes the proof. 
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Finding such a P is easy. One 
is to chop [a,b] into n equal 
pieces, where [a,b]/n< 6. 


Watch the summands collapse. 
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Ask Mathematica or Maple to 
antidifferentiate these functions; 
notice the strange ingredients in 
the answer. 


Definition 5.1, page 219 


We will see later that f, being 
continuous, is integrable. 
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Good news—and some cautions. Theorem 5.9 should look familiar. It allows 
many standard integral calculations of elementary calculus, like this one: 


2 3]> 58 18 124 
2d = ae =o Cr oe Trl 
/ anree eeheys Bhs ae oe 


Avoiding all that fuss over partitions, norms, and Riemann sums seems—and is— 
a big advantage in calculating a lot of integrals. But some sticky questions remain: 


* Canwe find an antiderivative? In the preceding calculation, with f(x) = 
x”, it was easy to find (or just to know) that F(x) = 23/3 is a suitable 
antiderivative. For other functions it can be much harder, or even impossi- 
ble, to find antiderivative formulas. For instance, neither of the harmless- 
looking functions 


f(x) =cos(a?) and g(x) = cos(z) In(a) 


has an “elementary antiderivative’”—a function built from standard function 
“elements”: polynomials, trigonometric functions, logarithms, etc. Without 
a suitable antiderivative F’', Theorem 5.9 is useless for calculation. 


° Is f integrable? We assumed in the theorem—and used crucially in the 
proof—that ip f exists. Elementary calculus courses often skirt the ques- 
tion of integrability, perhaps forgivably both because the matter is subtle 
and because the basic functions of elementary calculus turn to be integrable 
on intervals within their domains. 


Theorem 5.9 dodges the tough questions above by simply assuming, as hy- 
potheses, that all is well. Doing so simplifies the proof, but it weakens the theo- 
rem. In the next section, we’ll grapple more seriously with the question of which 
functions are integrable. A key result (which applies to all basic functions of ele- 
mentary calculus) is that every function f continuous on a closed interval a, b] is 
also integrable there. 


Using the Fundamental Theorem 


Among the drawbacks to using the (complicated!) basic definition to prove that 
an integral J exists is the need to know (or guess) a specific numerical value for 
I. This number may not be obvious, but Theorem 5.9 sometimes helps. 


EXAMPLE 4. Does i. (2x + sin(x)) dx exist? If so, how big is it? 
SOLUTION. The function f(x) = 2x + sin(«) has antiderivative F(x) = x? — 


cos(x), and so F'(7) — F(0) = 7? + 2 © 11.87. Theorem 5.9 does not guarantee 
that the integral exists, but it does say that 1? + 2 is the only possible value. © 


5.2. Properties of the Integral 


EXAMPLE 5. Show that i 2x dx = 1; use Theorem 5.9 and Definition 5.1. 


SOLUTION. The easy part is finding a value for the integral. Since the integrand 
f(x) = 2a has antiderivative F(x) = x”, and F(1) — F(0) = 1, Theorem 5.9 
says that one is the only possible value. 

The tricky bit is showing that the integral exists. We have, for now, only 
Definition 5.1 to work with, so we start as usual with a given positive e. Let 
6 = €; we'll use some clever algebra and a nice collapsing sum to show that this 
6 “works.” 

Let P = {20,%1,%2,... %n} be any partition of [0, 1] with ||P|| < 6. We 
claim that, for any samples S = {51, 52,...,S,} drawn from P, 


l-ex< ss f(s) Aa; = > 28;An; <1+e. 


i=1 i=1 
We will prove just the second inequality, leaving the first as an (easy) exercise. 
The key observation is that, since s; < x; for all 2, 


i=1 i=1 


It suffices, therefore, to show that the right-hand sum above can’t exceed 1 + e. 
For this we use an algebraic trick. If we write 


204 = (wy + U1) + (@ — Bi-1) = Bj + By_-1 + Az; 


for each 7, then substitution and a little algebra give 
i=l i=1 


= (a; + Li-1) Ax; + Az,” 


= ‘ 
= 5° («? — 274) + An? = S; + So. 
t=1 t=1 


We will handle S; and S2 separately. A telescoping argument helps with Sj: 
Sy = (21 — 20) + (@3— 21) +--+ (en — @n-1) 
a x = xe =1-0. 


The sum S2 is small. Since Ax; < 6 = € for each i, we get 


= Didnt < Dede) An=elae 
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We could have thought about 
areas, too. 
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Putting our results together gives 
nm n 
S 5 28; Aa; < Se 2a ha, = Sy + So a 1+e, 
i=1 i=1 


which completes the proof. © 


Exercises 
1. This problem is about Corollary 5.6, page 228. 


(a) Consider the function h(x) = g(x) — f(x) on [a, 6]. Why must h be 
integrable? Why is [” h > 02 
(b) Prove Corollary 5.6. 


2. Suppose that f is integrable on [a,b] and that m < f(x) < M for all 
x € [a,b]. Explain carefully why 


b 
m(b-a)< | f<M(b-a). 


3. Suppose f is integrable on [1,7] and —1 < f(x) < 2 forall x € [1,7]. 
Find upper and lower bounds for [. ‘ f. 


4. Use the result of Problem 2 to find upper and lower bounds on each of the 
following integrals; assume (it’s true!) that they exist. 


ee sin(a?) 
(a) (5+ ue ) ae 


200 + 2 
sin” @ 
(b) (s zs ) dx 
“a 100 


200 1 
(c) / ——— 
i009 LOO +sin* x 
5. It is a fact (we will prove it later, but just assume it here) that if f is contin- 
uous on [a, b] then f is also integrable on [a, 6). 


(a) Suppose that f is continuous on [a, b] and that f(a) > 0 for all x € 


(a, b]. Show that a f > 0, too. (Hint: Use the extreme value theorem 
and Corollary 5.6, page 228.) 


(b) Suppose f is continuous on [a, b] and that iE f = 0. Show that f(c) = 
0 for some c € [a, b]. (This fact is sometimes known as the mean value 
theorem for integrals.) 


5.2. Properties of the Integral 


(c) Give an example to show that the result in (b) need not hold if f is 
discontinuous on [a, 0]. 


. Suppose that f is integrable on [a, b]. Show (in the spirit of the proof of 


Theorem 5.5, page 227) that 3f is also integrable and that i 3f =3 ft f: 


. Suppose that both a f and i | f| exist. Prove the “triangle-like” inequality 


ie. < fi 


. Consider the function f defined by f(n) = sinn if n € Zand f(x) = 0 


otherwise. Show that f is integrable on every interval [—a, a], where a > 0. 
What is [", f? 


. Consider the “floor function” f(x) defined by 


f(z) =sup{n EN|n <a}. 
(a) Explain (cite an appropriate theorem) why f is integrable on every 
interval [a, 0]. 
(b) What is f°, f? Why? 
(c) Whatis [", f? Why? 


. We showed in and near Problem 12, page 226, that f(x) = x is integrable 


on [a, bj, and that f vdx = (b? — a?) /2. 


(a) Use the formula above and Theorem 5.5 to find ihe (3a + 7) da. 


(b) A function g has the W-shaped graph formed by connecting the dots 
at (0,2), (1,0), (2,1), (3,0), and (4, 2) in the zy-plane. Explain why 
the integral It g exists and find its value. 


. Assume (it’s true) that the integral in each part following exists. Use Theo- 


rem 5.9 to find the value. 
(a) i. x? dx 
(b) fo sina dr 
(c) f? (C+ Da + Ex? + F(23)) dx 
(d) ie re® da 
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See Proposition 5.4, page 223. 


See Example 3, page 220, and 
Example 4, page 221. 


See Example 4, page 234. 


We did something similar—for 


similar reasons—for sequences. 
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12. Consider the function f defined by f(1/n) = 1 forn € N and f(x) = 0 
otherwise. Assuming that f is integrable (as we’ll see in the next section), 
complete the following proof sketch to show that J = [, i f=0. 

(1) Clearly I > 0, so it is enough to show I < e« for all € > 0. (ii) For given 
€, define f.(x) = 1if0 < x < «, and f.(x) = 0 otherwise. Then f, is 
integrable, and ih fe(x) = e. (iii) We have f(x) < f-(a) for all but finitely 


many 2, so if f< a fe(x) =e. 


13. Suppose that both i f and Lg exist, and that f(a) < g(x) for all but 
finitely many « in [a, b]. Show that te f< ie g. 


5.3 Integrability 


How can we decide whether a given function f : [a,b] — R is integrable on 
[a,b]? The question is obviously important: a useful integral should apply to 
many functions. The question is also difficult: deciding which functions satisfy a 
complicated definition naturally takes some work. 

So far we’ve seen only piecemeal results: 


¢ If f is integrable, then f is bounded. 
¢ A discontinuous function f may or may not be integrable. 


© We’ve also said—but not proved—that every continuous function is inte- 
grable. 


In this section we approach these matters rigorously, and identify some im- 
portant classes of integrable functions. Our main tool, detailed in Theorem 5.14, 
is the box sum, a useful and practically usable criterion for integrability. 


Criteria for Integrability 


Our definition of the Riemann integral is relatively technical, so we expect that 
detecting integrability (or its absence) will be challenging, too. An immediate 
hurdle is the fact the definition requires a “candidate” J for the value of a given 
integral. It turns out that we can skirt this problem with a “Cauchy criterion” for 
Riemann sums. 


Lemma 5.10. Let f[a,b] — R be a bounded function. Suppose that for every 
€ > 0 there exists 6 > 0 such that 


IRS(f,P1, 51) — RS(f, P2, S2)| < € 


5.3. Integrability 


whenever P, and P2 are partitions of |a, b| such that both ||P || < 6 and ||P2|| < 
6, and samples S, and Sz come from P, and P32. Then the integral fp f exists. 


Proof: The idea is to concoct a certain Cauchy sequence { I, } of numbers, whose 
limit J will turn out to be the desired integral. 

To get started, for each n € N we use the hypothesis to choose a positive 
number 6,, such that 


1 
IRS(f,P1,S1) — RS(f, Pe, S2)| < A 


for all Riemann sums based on partitions P; and P2 with both ||P1|| < dp and 
||P2|| < 6n. For technical reasons we choose the 6,, to be decreasing: 5; > 62 > 
03 >.... 

Next, for each n we choose any particular partition P,, with ||Pn|| < on 
and any particular set S,, of samples from P,,. The associated Riemann sum 
RS(f, Pn; Sn) is then a number; let’s call it J, for short. This process produces 
a numerical sequence {J,,}, which is readily shown to be Cauchy—and hence 
converges to some limit J. (The proof that {J,, } is Cauchy uses the fact that the 
dn, decrease; see the exercises.) 

Last, we use Definition 5.1, page 219, to show that I is the sought-after inte- 
gral. For given € > 0, we first choose any positive integer N for which both 

€ 1 € 
|In I <— 3 and WV < 3? 
and we set 6 = dy as chosen above. This 6 does what Definition 5.1 asks. If P 
is any partition with ||P|| < dy, S is any set of samples, and RS(f, P, S) is the 
corresponding Riemann sum, then the triangle inequality and our choices give 


IRS(f,P, 5) — I] < |RS(f,P, 5) — In| + [In — 1| 


Pgeengs 
= ee 2 
= 9 9 ’ 


as Definition 5.1 requires. 


Not quite there. Lemma 5.10 will prove useful, but it has a serious practical 
drawback: showing that the hypothesized inequality holds for all suitable parti- 
tions P and all corresponding sample sets S seems difficult. The box-sum crite- 
rion, which turns out to be equivalent to the hypothesis of Lemma 5.10, will prove 
much easier to work with. 


Integrability and Box Sums 


Figure 5.4 shows (shaded) the lower sum, the upper sum, and the correspond- 
ing box sum—the difference between upper and lower sums—for a well-behaved 
integrand f and a five-piece partition of [a, b}. 
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Every Cauchy sequence has 
one. 


Convince yourself this is 
possible. 


A regular partition works fine. 


There are infinitely many P for 
each 6, and infinitely many S for 
each P. 
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a x) Xy X3 X4 a x) xX, x3 X4 


(a) A lower sum (b) An upper sum 


1 1 1 1 1 
a x, xX X3 X4 b 


(c) A box sum 


Figure 5.4. Lower, upper, and box sums for an integral. 


The picture suggests the appropriate definitions, but some extra care is needed 
for discontinuous integrands. The fine print follows. 


Upper and lower sums. For any given partition P of an interval [a, b], there 
are infinitely many ways of choosing samples S compatible with P, and hence 
infinitely many possible Riemann sums associated to P. To get upper and lower 
bounds on the values of all these Riemann sums, it is helpful to consider upper 
sums and lower sums. 

If the integrand f is continuous on |a, b], as in Figure 5.4, then upper and lower 
sums are ordinary Riemann sums, with sampling points chosen to maximize or 
minimize f over each separate subinterval. (A continuous function has maximum 
and minimum values on each subinterval, by the extreme value theorem.) In this 
case, therefore, upper and lower sums are simply the largest and smallest possible 
Riemann sums for a given partition. 

A discontinuous integrand f, on the other hand, need not attain maximum and 
minimum values on a given subinterval. If f is bounded on [a, b], however, then 
both 


5.3. Integrability 


m, = inf{ f(x) | a € [x;_-1,2;]} and M; =sup{ f(x) | x € [x;_1, 2] } 
exist for each 2, and so the following definitions make sense. 


Definition 5.11. Let f : [a,b] + R be a bounded function, and P = {29,21,..., 
Xn} a partition of [a,b]. With m; and M; as above, we define upper and lower 
sums as follows: 


US (f,P) = 5° MA; = M Ax, + M.A. +--+ + MpAan; 
i=1 


LS(f,P) = Ss mAx;, =m Ar, + meAreg +-++ + mynAry. 


i=l 


Upper and lower sums may not be Riemann sums, but they have useful con- 
nections to Riemann sums and to the integrals they approximate. We collect two 
such facts in the following proposition, using RS, US, and LS to denote Riemann, 
upper, and lower sums. 


Proposition 5.12. Let f : [a,b] > R be a bounded function, and P any partition 
of {a, 6]. 


(i) If S is set of samples from P, then 


LS (f,P) < RS(f,P,S) < US(f,P). 


(ii) If f is integrable, with ah f =I, then 


About proofs. The inequality in (i) follows immediately from the definitions of 
upper and lower sums. Part (ii) is a slightly subtler; see the exercises for both 
parts. 


Clamping down: box sums. Proposition 5.12 suggests that it is a good thing for 
integrability when upper and lower sums are close together. Box sums measure 
this closeness. 


Definition 5.13 (The box sum). Let f : [a,b] — R be a bounded function and 
P = {X0,%1,...£n} a partition of |[a, b]. The associated box sum is the difference 


US (f,P) — RS (f,P) = (Mi — mi )Aa +--+ + (Mn — mn )ACy. 


Box sums have a nice geometric interpretation, as seen already in Figure 5.4(c). 
Figure 5.5(a) shows another box sum for the same integrand as before, but here 


The completeness axiom 
guarantees this. 


They are Riemann sums if f 
happens to be continuous. 


Upper and lower sums don’t 
depend on samples, and so 
their notations don’t involve 

ans. 
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ax Xo X,4 5 a xX X X,4 5 


(a) A smooth integrand (b) A discontinuous integrand 


Figure 5.5. Two more box sums. 


with a finer partition. Figure 5.5(b) shows another ten-element box sum, this time 
for a discontinuous integrand. 

Here is the key point: For both integrands in Figure 5.5, the total shaded 
area—the box sum—can be made as small as we wish by requiring that the par- 
tition have small norm. This seems clear enough for the smooth integrand in 
Figure 5.5(a): For a partition with tiny norm, the shaded area becomes smaller 
and smaller, approaching a skinny “tube” around the graph of f. 

Less obvious, but equally important, is the fact that the total shaded are can 
also be made small for the discontinuous integrand in Figure 5.5(b). Figure 5.6 
suggests why. Any partition ? may generate one or two fall box sum elements, 
but if has small norm, then all boxes are narrow and their areas don’t amount 
to much. We formalize these ideas in a theorem. 


11 
aX, Xz b 


Figure 5.6. Fine partitions, small box sums. 


5.3. Integrability 


Theorem 5.14 (The box-sum criterion). A function f : [a,b] — R is integrable 
on [a, b] if and only if for each € > 0 there exists some partition P of [a,b] with 
box sum less than «. 


Many uses. We outline the rather technical proof at the end of this section; it 
involves some meticulous bookkeeping. First we illustrate the theorem’s uses and 
advantages—including the fact that for a given € we need only one suitable par- 
tition to prove integrability. Observe also that the box-sum criterion works both 
ways: it detects both integrability and non-integrability. Example 1 illustrates 
both of these uses. 


EXAMPLE 1. Consider the functions 


| He< 05, _ jo ifr ¢Q 
ro={' if x > 0.5; w= 1" ifxe €Q. 


We have shown by other methods that f is integrable on [0,1] but g is not. What 
do box sums say? 


SOLUTION. For f, consider the three-piece partition 
P = {0,0.499, 0.501, 1}, 


which “isolates” the jump discontinuity at « = 0.5 in the skinny interval [0.499, 
0.501]. The box sum—only 0.002 in this case—can be made even smaller nar- 
rowing the middle interval even further. Thus f is indeed integrable. 

For g the conclusion is negative: for any partition P of [0,1], every box ele- 
ment has height one, and so the box sum is one. ©) 


The box-sum criterion is exactly what we need to prove some familiar and 
important—but otherwise elusive—results. 


Theorem 5.15. If f is continuous on {a, 6], then f is integrable, too. 


The proof idea. Continuity means that f cannot rise or fall very much over a 
small domain interval. Thus, for a partition with small norm, all box sum elements 
must be short. Since the total width of boxes is only (b — a), their total area must 
also be small. 


Proof: Let « > 0 be given. Because f is continuous on [a,b], it is also uni- 
formly continuous there, according to Theorem 3.22, page 166. This means we 
can choose 6 > 0 such that 


f(s) — f(t)| < — whenever |s—t| < d and s,t € [a, 0]. 


With integral 0.5. 


Draw your own picture. 
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A regular partition will do. 


We don’t even have candidates 
for the values of the integrals. 
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Now let P be any partition with ||P|| < 6. Since f is continuous on each subin- 
terval [2;_1, 2;], it attains maximum and minimum values there, so there exist s; 
and ¢,; in [vi-1, xi] with 


f(si) = Mi =sup{ f(x) | x € [xi-1, xi] }; 
f(ti) = mi = inf{ f(x) | x € [xi-1, xi] }. 


Our uniform continuity condition gives M; — m; < €/(b — a) for all 7, and so 


box sum = S- (M; —m;) Aa; 
i=1 


<i pt An= — See 
i=1 i=1 


as desired. 


More on integrability. We can use the box-sum criterion to prove other familiar, 
reasonable-seeming properties of the integral. Proposition 5.16 gives two sam- 
ples. 


Proposition 5.16. Suppose that f : [a,b] — R is integrable. 


(i) I[fa<c<b}, then Je f and en exist, and 


[sefrefs 


(ii) The function | f| is integrable, and 


[s < fin. 


Proof (sketch): The main proof challenge for both (i) and (ii) turns out to be inte- 
grability: We need to show that if f is integrable on [a, b], then it is also integrable 
on the smaller intervals [a, c] and on [c, ], and that | {| is also integrable on [a, ]. 
Once all the integrals in question are known to exist, the rest is easy. The equa- 
tion in (i) is essentially Theorem 5.8, page 232. See Problem 7, page 237, for the 
inequality in (ii). 

Proving our integrability claims directly from the definition of integrability 
would be difficult. With the box-sum criterion, the proof is routine. 


Let « > 0 be given. By Theorem 5.14, applied to the integral [ be f, there exists 
a partition P of [a, b] with box sum less than e. 
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(a) Bigger box sum (b) Smaller box sum 


Figure 5.7. Comparing box sums for f and for | f|. 


Let’s show first that f and i f exist. We may as well assume that c € P; 


if not, we can add c to P without increasing the box sum. Thus, ? has the form Sketch the situation to convince 
yourself. 


PH {O= Gy 81, <5 he = 6 By +s Un = bf, 
and therefore 
Pi = {a= 20,21,.-..,%m =c} and Po = {C=%m,U%m41,---,Ln = O} 


are, respectively, partitions of [a,c] and [b, c]. Because P, and P2 are subsets of 
P, and all box summands are nonnegative, the box sums for P; and P2 are clearly 
smaller than that for P. By Theorem 5.14, f is indeed integrable on each smaller 
interval. 

To show that || is also integrable on [a,b], we compare box sums for f and 
| f|. Figure 5.7 illustrates the nice answer: For any partition P of [a, }], 


box sum for | f] < box sum for f. 


In Figure 5.7(a), box-sum elements that “straddle” the x-axis become smaller for 
| f|, as shown in Figure 5.7(b). Thus any partition for f with box sum less than € 
works for | f|, too. 


Proposition 5.17. If f : [a,b] — R is monotone, then f is integrable. 


The proof idea. Figure 5.8 illustrates the proof idea for an increasing integrand, The pictured integrand has one 

with m and M as lower and upper bounds. discontinuity, but a monotone 
If P is a regular partition with norm 6, then the box sum elements can be ee ney 

“stacked” vertically as shown, to give total area less than (14 —m)0. By choosing 

6 small, we can keep the box sum as small as we like. 
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mr 


a * % Xn1 D 


Figure 5.8. Stack the boxes: why every monotone function is integrable. 


EXAMPLE 2. What does Proposition 5.17 say about integrability of some fa- 
vorite functions? 


SOLUTION. Proposition 5.17 guarantees that integrals like 
100 ie 100 100 100 1 
x? dx, Ja dx, | In(a + 1) dz, | ——~ dx 
i 0 0 o %In(e+1) 


all exist, because all integrands are monotone on (0, 100]. With help from Theo- 
rem 5.8, page 232, we can conclude that all of 


100 100 100 
| cos x dx, | In(cos x + 2) dx, | eS” dx 
0 0 0 


also exist. Even though the integrands are not monotone on [0, 100], in each case 
we can break [0, 100] up into finitely many smaller subintervals, on which each 
of which the given integrand is monotone. ©) 


Proving the Box-Sum Criterion 
Theorem 5.14 has both an “if” and an “only if” part. We start with the latter— 


easier—assertion. 


Why the box-sum criterion is necessary. Suppose f is integrable on [a, b]; let 
I= he For any given € > 0, we’ll find a partition P of [a, b] with box sum less 
than €. Since f is integrable, we can choose 6 > 0 such that 
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€ € 
fas Lope 
5; < RS (f,P,S) < ar 


for every Riemann sum RS (f,P,S) based on a partition P with ||P|| < 6 and 
samples S drawn from P. Now let P be any such partition; this P will do what 
we want. 

To see why, note first that the upper sum US (f,P), although perhaps not a 
Riemann sum itself, is the supremum of all possible Riemann sums based on P. 
In particular, we can choose a set of samples S from ? so that 


RS (f,P,S) > US(f,P) — 5, 
and so we have 


€ € E€ € 
Sf ope Spa 
US(f,P) <RS(f,P,S)+5<I+ p+ g=It+5 


A similar argument shows that 


LS (f, P) STs 
and so 
box sum = US(f,P) — LS(f,P) <«, 


as we wanted to show. 


Why the box-sum criterion is sufficient. The plan of the proof is to use the box- 
sum condition to obtain the “Cauchy criterion” of Lemma 5.10, page 238, which 
in turn implies integrability. We show, in fact, that if for given e > 0 we can find 
some partition Po with small box sum, then there exists some (very small!) 6 > 0 
such that every partition P with ||P|| < 6 must also have small box sum, and this 
does the trick. 

One picky technical lemma will prove useful. We leave its straightforward 
proof as an exercise. 


Lemma 5.18. Let f : [a,b] — R be a bounded function, with m < f(x) < M 
fora € [a,b]. Let P be a partition of |a, b] with ||P|| < 6, and let US = US (f,P) 
and LS = LS (f, P) be the corresponding upper and lower sums. 

Adding one more point to P produces new upper and lower sums, USnew and 
LSnew, such that 


(i) USnew < US and LSypew > LS; 
(ii) US—USnew <(M—m)d and LSnew — LS < (M — m)o. 


In words: Adding one point to P decreases upper sums and increases lower 
sums—but by no more than (M — m)o. 


A regular partition will do. 


Draw a simple picture to get 
started. 
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Unlikely as that may seem in 
advance. 
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Now we can prove that the box-sum condition implies integrability. For given 
€ > 0, we first choose any particular partition 


Po = {wo,wi,-.-,wn}, 


of [a, 6] with N partition points and box sum less than €/2; that is, 
US (f,Po) — LS(f,Po) < 5. 


Next we set Z 


4(N —1)(M —m)’ 
where M and m are, respectively, upper and lower bounds for f on [a, b]. This 6 
will turn out to work in Lemma 5.10. 

Now let P be any partition of [a, b] with ||P || < 6. Consider the new partition 
P' formed formed from P by adding in the N — 1 partition points w 1, we, ..., 
wyn-1. (Since wo = a and wy = 5, adding them to P has no effect.) 

Now Lemma 5.18(1) implies that 


US (f, P’) < US(f,Po), 


because P’ “refines” Po. Because P’ has at most N — 1 more points than P, 
Lemma 5.18(ii) says that 


US (f,P’) > US(f,P) —(N -1)-(M—m)-6 


€ 


6= 


Putting these inequalities together gives 
US(f,P) < US(f,P!) + = < US(f, Po) + =. 
A similar argument shows that 
LS (f,P) < LS (f,Po) + 7 


All these inequalities boil down saying that, for any partition P with ||P|| < 6, 
both the upper and lower sums LS (f,P) and US (f, P) lie in the interval 


(1S(f, Po) — $, US (f,Po) +5), 


which has length less than ¢. In particular, every Riemann sum RS(f,P,S) is 
between LS (f,P) and US (f, P), and therefore lies in the same interval. 

Finally, we observe that if both P; and P» are partitions of [a,b] with both 
|Pil| < 6 and ||P2|| < 6, and if samples S, and Sy come from P; and P2, 
respectively, then both RS(f, Pi, S$) and RS(f, P2, S2) lie in the same interval 
of length less than ¢, and are thus within € of each other. Thus the hypothesis of 
Lemma 5.10 is satisfied, and the integral E f exists. 
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Exercises 


1. 


. Show using box sums and Theorem 5.14 that the function f(x) = x 


. Show using box sums and Theorem 5.14 that the function f(x) = x 


Suppose f is continuous on R. Explain why the functions defined by f?(), 
sin(f()), f(sin x), and In (sin f(x) + 2) are all integrable on every inter- 
val [a, 0]. 


. Consider the integral ee sina dx. Find a partition P of [0, 10] that gives 


box sum less than 0.01. (Hint: Use the fact that inequality |sin z — sin y| < 
|x — y| holds for all x, y € R.) 


. Consider the function f defined by f(n) = Oifn € Zand f(x) = 7 


otherwise, and let « > 0. Find a partition P of [0, 10] that gives box sum 
less than €. (Hint: Isolate the “offender points” in small subintervals.) What 
- p10 
is f? 

0 


. Use box sums to show carefully that any function of your choice is not 


integrable on [0, 1]. 
2 
integrable on [0, 1]. 
3 


integrable on [0, 1]. 


. Suppose that f is integrable on [0,2]. Show using box sums and Theo- 


rem 5.14 that f is integrable on [0, 1], too. 


. We said in Example 2, page 246, that ie cos x dx can be shown to exist 


using Proposition 5.17, page 245 and Theorem 5.8, page 232. Give the 
details. 


. State and prove the converse of Lemma 5.10. 


. Show that the sequence {J,,} defined in the proof of Lemma 5.10 is Cauchy. 


. Show that if f : [a,b] — R is a bounded function, P is a partition of [a, b], 


and S is any sampling set from ?P, then 
LS(f,P) < RS(f,P,S) < US(f,P). 


Note: This is (i) of Proposition 5.12; it says that for any partition P, the cor- 
responding upper and lower sums are upper and lower bounds, respectively, 
for the set of all Riemann sums associated with P. 


. Give a “picture proof” of Lemma 5.18, page 247. (Hint: Adding one more 


point to a partition changes only one box sum element. Convince yourself 
that this change is no more than the claimed amount.) 
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14. 
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Show that if a f = I exists, and P is any partition of [a, b], then 
LSC PST Ss US(EP)« 


Note: This is (ii) of Proposition 5.12; it says that, for any partition P, the 
corresponding upper and lower sums “trap” the exact value of J from above 
and below. 


(Hint: To show that US(f,P) overestimates I, consider the “stair-step” 
function fpig defined by 


M, ifa<xa<2 
My ifa,<x2< 29 
foig(a) = 


M, iftz,,<a<b 


Why is fpig integrable? What is its integral? How does rs foig Compare to 
b 

lage 

Consider the function f given by f(1/n) = 1ifn € N and f(x) = 0 

otherwise. Show, using box sums and Theorem 5.14, that J = iis f exists. 


(We know already—see Problem 12, page 238—that if the integral exists, 
then J = 0.) 


The point is to find, for any given € > 0, any partition P of [0, 1] with box 
sum less than «. To do so, start at the left, with x9 = 0 and az, = €/2; 
this box element contributes «/2. Since only finitely many of the 1/n lie 
to the right of €/2, one can choose the remaining partition points so that 
each remaining “bad point” lies in an interval small enough so that the total 
length of these intervals is less than ¢/2. Complete this sketch to give a 
careful proof. 


5.4 Some Fundamental Theorems 


We met a useful but preliminary version of the fundamental theorem of calculus 
(Theorem 5.9, page 233) in Section 5.2. Using Theorem 5.15, page 243, we can 
now tie up a serious loose end—existence of the integral—in connecting integrals 
and derivatives. Here is a new, improved version: 


Theorem 5.19 (Fundamental theorem of calculus, version 1). Let f : [a,b] > 
R be continuous, and suppose F"(x) = f(x) for all x € [a,b]. Then f is inte- 
grable on {a, b|, and 


b 
/ f = F(b)— F(a). 


5.4. Some Fundamental Theorems 


Proof: We’ve already done all the hard work. By Theorem 5.15, the integral 


f 7 f exists, and Theorem 5.9 guarantees that the integral’s value is indeed F'(b) — 
F(a). 


Elementary calculus revisited. Theorem 5.19 makes precise a sense in which 
integration and differentiation are inverse processes. Now that we know it, we 
can assert standard calculations of elementary calculus, like this one— 


1 3 1 1 
[ #u= 5 =--0, 
0 3} 3 


with perfect confidence. 
EXAMPLE 1. Find [> sin x dz exactly. 


SOLUTION. Nothing could be easier. If f(a) = sin x, then F(x) = — cos() is 
an antiderivative, and so 


sin x dx = — cos(7) + cos(0) = 2. 
0 


Having worked very hard to build the machine, we now only need to turn the 
crank. ©) 


EXAMPLE 2. Find f> sin(«?) dz. 


SOLUTION. The problem superficially resembles the one in Example 1, and it 
would be nice to proceed in the same way. But there is a problem: f(x) = sin(x?) 
has no nice antiderivative formula F(x) into which we can plug endpoints. We 
can always estimate the answer numerically, but that is less satisfying. ©) It's about 0.773. 


Antiderivatives do exist. The difficulty in Example 2 notwithstanding, the inte- 
grand f(a) = sin(x?) there does have an antiderivative—it just doesn’t happen 
to be expressible in simple symbolic form. Indeed, every continuous function has 
antiderivatives, and another version of the fundamental theorem of calculus says 
why. 


Theorem 5.20 (Fundamental theorem of calculus, version 2). Let f : [a,b] > 
R be continuous. Define a new function F by 


F(a) = | f forall € |a, db]. 


Then F'(c) = f(c) forall c € (a,b). 
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Proof: Note first that F(a) is defined for all x € [a,b]. Part (i) of Proposi- 
tion 5.16, page 244. says so, as does the fact that f is continuous on all subinter- 
vals [a, ]. 

Fix c € (a,b); we’ll show F"(c) = f(c). By definition, 


F'(c) = lim — 


if the limit exists. We'll show that 


F(e+h)— Fle) _ 


h—0+ 


the proof for the left-hand limit (where h — 07) is similar. 
For any h > 0, we have, from basic properties of the integral, 


Fe+h)- Fo) _ fe f- Set ies 
h = h ae 


Since f is continuous on [c,c + A], it assumes maximum and minimum values, 
say M;, and mp», on this interval. Because mp, < f(a) < My for x € [c,c+ Al, 
it follows that 


cth cth eth 
my-h= | made < [ fe) dr < f M;, dx = M,-h, 


and therefore 


This means, in turn, that the middle quantity is “intermediate” between m,, and 
My, for the continuous function f. Thus, by the intermediate value theorem, there 
exists some input x), with c < x, < c+ hand 


(oe f 


er ACLoL 


As h — 0+, we have x, — c, and since f is continuous at c, we have 


as desired. 


5.4. Some Fundamental Theorems 


Nice to know, but .... Theorem 5.20 assures us that every continuous function 
f, no matter how ill-behaved, has an antiderivative F’. (Being an antiderivative, F 
is automatically differentiable and hence also continuous.) This is nice to know in 
the abstract—but not helpful for calculating integrals like the one in Example 2. 
Indeed, many useful and harmless-looking calculus functions, including 

sin x 


cos(z*), exp(x), and ; 
x 


turn out not to have elementary antiderivatives, and are said not to be “integrable 
in closed form.” 


Average values, and another mean value theorem. Integration has a natural 
connection to averaging: 


Definition 5.21. If f is integrable on [a, b], then the average value of f on [a, 6] is 
given by 


ne 
b-a 


average value = 


Embedded in the proof of Theorem 5.20 is another theorem (and its proof) of 
independent interest: 


Theorem 5.22 (Mean value theorem for integrals). [f f is continuous on {a, b], 
then there exists c in (a, b) for which 


b 
[ f= 1-6-4) 
a 
In words: a continuous function on |a, b| assumes its average value. 


We explore some implications in the exercises. 


Exercises 


1. Show that if ft f = and f is continuous on [a, }], then f(c) = 0 for some 
c € (a,b). Give an example to show that the conclusion need not hold if f 
is not continuous. 


2. Let f and g be continuous on [a, b] and suppose Hy f= ‘is g. Show that 
f(c) = g(c) for some c in [a, }]. 


3. Suppose f has average value 3 on [a, b]. What is the average value of 5f +7 
on [a, 6]? Why? 
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Some are very ill-behaved. 


Elementary functions are nice 
combinations of the familiar 
calculus-style functions. 
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. In each part, find J = i f, the average value of f on [0,2], and a value 


of c at which the average value is achieved (in the sense of Theorem 5.22). 


(a) f(x) = Vz 
(b) f(z) = 2. 

(c) f(a) = 27, 
(d) f(z) =a” 


(a) f(x)=Vz 
(b) f(x) = 2. 

(c) f(a) = 27, 
(d) f(z) =a” 


. Let h : [a,b] — R be continuous on [a,b]. Suppose that h(x) > 0 for all 


x € [a,b] and that fe h = 0. Use Theorem 5.20 to show that h(x) = 0 for 
all x € [a,b]. (Hint: Use properties of the function H(x) = J” h to derive 
the result.) 


. Let f be continuous on [a, b]. Suppose that the average value and the maxi- 


mum value of f on [a, b] are equal. Show that f is constant. Must the same 
result hold if f is not continuous? Why? 


. Let f : [(0,0c0) — R be continuous everywhere, and suppose that f has 


average value 3 on every interval |0, b]. Find a formula for /. 


. Let f : [0,co) — R be continuous everywhere, and suppose that f has 


average value b/2 on every interval [0, b]. Find a formula for f. 


Let f : R — R bea linear function. Show that the average value of f on 
any interval [a,b] is f(m), where m is the midpoint of [a, 6]. 


2 


. Find an interval [a, 6] such that the average value of f(a) = 2° occurs at 


the midpoint. 


. We used the mean value theorem for integrals to prove Theorem 5.20. Use 


Theorem 5.20 and the mean value theorem for derivatives to prove the mean 
value theorem for integrals. (Assume that f is continuous, of course.) 


. (This problem refers to material in Section 4.4.) Consider the functions 


fn(x) = x” forn = 1,2,3,... and the limit function f given by f(x) = 0 
if x € [0,1) and f(1) = 1. 


5.4. Some Fundamental Theorems 


(a) We showed in Section 4.4 that the sequence { f,, } converges pointwise 
to f on [0,1]. Does fo fn converge to if f? Explain. 
(b) The sequence { f,,} converges uniformly to f on [0, 0.9]. Does Ng fin 
0.9 . 
converge to if f? Explain. 


14. (This problem refers to material in Section 4.4.) Consider the functions hy, 
defined by h(x) = nif x € (0,1/n) and h,(x) = 0 otherwise. Let h be 
the constant function h(x) = 0. 


(a) Show that the sequence {h,, } converges pointwise to h on [0, 1]. Does 
de hy converge to ite h? Explain. 

(b) Show that the sequence {h,,} converges uniformly to h on [0.1, 1]. 
Does ee hy converge to ier h? Explain. 


15. (This problem refers to material in Section 4.4.) Let {f,,} be a sequence of 
continuous functions on [0,1], and suppose { f,,} converges uniformly on 
(0, 1] to a function f. 
(a) Explain why f must be integrable on (0, 1]. 
(b) Prove that i fn converges to fo f: 
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1.1 


Solutions 


Numbers 101: The Very Basics 


(a) The claim makes sense and is true. 

(b) The claim makes no sense; v/8 isn’t a subset. 

(c) The claim makes sense and is true. 

(d) The claim makes sense but is false; consider a = 0 and b = J2. 

(e) The claim makes sense and is true. 

(f) The claim makes sense but is false: consider a = 0. 
The number 1/a is an integer only if a = +1. The number 1/a is rational for all 
nonzero integers a. The equation 1/a = a holds only if a = +1. 
Theorem 1.1 is useful. Because Q is closed under addition, multiplication, and divi- 


sion (if denominators aren’t zero), expressions like those in (a) and (b) are rational— 
unless, for (b), p = gq = O. Expressions involving square roots are different. If 
p = q = 1, for instance, then ,/p? + g2 = V2 is irrational; the same expression 


is rational if p = 3 and q = 4. The quantity \/p? + 2pq + q? is always rational, 


since \/p? + 2pq + q? = \/(p + g)? = +(p+q). (Note that /(p + ¢)? =p+q 


may be false.) 


(a) The answer is no. For instance, the set {1,1/2,1/3,...} is a subset of Q, 
but has no least element. 


(b) The set R = { 1, 10, 100, 1000, ... } does have the well-ordering property; 
every nonempty subset includes a smallest power of 10. 


(c) The set T = { —3, —2, —1, ..., 41, 42} (like all finite sets of real numbers) 
does have the well-ordering property, since every nonempty subset of T’ is 
also finite, and hence has a least element. 


(d) If we trade “least” for “greatest” in the well-ordering property, the result no 
longer holds for N, since N itself has no greatest element. The property does 
hold for the finite set T, and also for Z \ N = {..., —3, —2, —1, O}. 


Since In In In n tends to infinity, it must exceed two for large n. The well-ordering 
property guarantees that a smallest such no exists. (Using a calculator we can see 
that no has about 703 decimal digits.) 


Assume toward contradiction that /3 = a /b for integers a and 6, and the fraction 
is written in reduced form. Then squaring both sides gives 3b? = a”. This implies 
(essentially as in the proof of Theorem 1.2) that 3 divides both a and b, which 
contradicts the assumption that a/b is in reduced form. 
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10. 


11. 


13. 


1.2 


Solutions 


(a) We argue by contradiction: If x is rational, then (by Theorem 1.4) x” is ratio- 
nal, too, which contradicts our assumption. 


(b) Another proof by contradiction. If « = /2 + V3 is rational, then 7? = 
5 + 2V6 is rational, too. This implies, in turn, that /6 is rational, which is 
absurd. 


(c) Yet another proof by contradiction. Let’s write 7 = J2 + V3 + V5, then we 
have « — /5 = V2 + V3, and suppose z is rational. Squaring both sides of 
the last equation and simplifying gives 


x’ + 10avV5 = 2V6, 


which is progress, since only two square roots remain. Squaring again gives 
x! + 202° V5 + 5002” = 24, 


which is even better, as only one square root is left. The last equation implies 
that 


= 2 24—500x* — x* 


Because x is rational, so is the right-hand side above. This absurdity com- 
pletes the proof. 


Parts (i) and (ii) follow from the fact that 1 < a/b < 2. For part (iii), note that 


a” (2b—a)? 4b? — dab + a? 


b2 (a—by? a2? — 2ab+ Bb?’ 


and substituting a? = 2b? shows that the last fraction is 2. 


This all shows that if \/2 = a/b holds for any positive integers a and b, then we can 
find a new fraction a’ /b’ with /2 = a’ /b’ and b’ < b, which is absurd. 


If a and b are rational, then 
1  a-by2 
a+b/2 a? + 2b?" 


This shows that elements of F' have multiplicative inverses in F’. The rest is easier. 


Sets 101: Getting Started 


(a) DCI; DEC. 

(b) B= {me A| mhas 31 days}. 

(c) Ax Dis the set of ordered pairs (January, 2), (February, 2), ..., (December, 2), 
(January, 3), (February, 3), ..., (December, 3). There are 24 such pairs. 

(d) A\B = {February, April, June, September, November}; B\.A = 0; ANC = 
{November}; BN A= B; DN I=D;DUI=I. 


Solutions 


3. (a) Claim (i) is false. As one example, take A = R and B = @. Then 
R\ (AUB) =Obut (R\ A)U(R\ B)=R. 


(b) If A = B, then both claims are true. 


7. (a) If S = {1,2,3} then P(S) = {0, {1}, {2}, {1,2}, {3}, {1,3}, 442, 3}, 
{1, 2,3} }, a set with eight members. 


(b) Every subset of S is clearly a subset of T’. 
(c) Each subset A of Nio corresponds to two subsets of N11: A and AU {11}. 
9. S" has n elements. Each element of S’ is formed by omitting one element of S. 

11. (a) Three vertical lines. 

(b) Three horizontal lines. 

(c) The set of integer “lattice points” above the x-axis Z x N. 

(d) The parabola y = x”. 

(e) A vertical sine curve. 

(f) The empty set. 
12. (a) The picture is a diagonal stripe from upper left to lower right. 

(b) The black squares can be described by the set {(x, y) | x + y is even}. 


(c) The element (2, 3, black) corresponds to a black square at position (2, 3). The 
set G x {black, white} represents all possible ways to choose a square and 
color it. 


(d) A picture is, in effect, a subset of squares to be colored black. Thus P(G) 
corresponds to the full set of possible pictures. 


1.3. Sets 102: The Idea of a Function 


1. There are many possibilities; following are some. 


(a) Let A be the set of all humans who have ever lived, and B the set of all women 
who have ever lived. The function is not injective, because siblings have the 
same mother. The function is not surjective, either, since some women are 
not mothers. 


(b) Let A be the set of all mothers of sons, and B the set of all male humans. 
Then FIRSTBORNSON: A — B is one-to-one but not onto. 


(c) Let A be the set of all humans and B the set of all colors. Then EYECOLOR : 
A — Bis neither one-to-one nor onto, since several people have blue eyes, 
and nobody has silver eyes. 


(d) Let A be the set of all US citizens and B = {January 1, January 2, ..., 
December 31}. Then BIRTHDAY : A — B is onto (every day is someone’s 
birthday) but not one-to-one (several people have the same birthday). 
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10. 


1.4 


1. 


Solutions 


(a) The domain is the set of states that begin with M. The range is the finite set 


{3, 4,6, 10, 11, 12, 17}. The codomain can be any set that contains the range, 
such as N or even R. 


(b) f is not injective; f(Maryland) = f(Minnesota). 
(c) f is surjective only if the range and codomain are the same set. 
(d) A better name than f for this function might be ELECTORALVOTES. 


(a) L is one-to-one since, if L(a1) = ax, + b = ax2 + b = L(x), then basic 


algebra gives 


avi +b=are+b ax, = axe 1 = 2, 


(note that a # 0 is essential!) as the definition of injectivity requires. 


L is onto because if yo is any real number, then we can set xo = (yo — b)/a, 


and we find ;: 
f(vo) =aro + b= a — + b= yo. 


(again, a # 0 matters) as the definition of surjectivity requires. 


(b) If we define M(x) = (x — b)/a, then we have, for any x € R, 


as desired. A similar calculation shows that M(L(ax)) = 2 forall x € R. 
Thus, M@ = L~1,as hoped for. 


(a) Claim: If both f and g are one-to-one, then so is go f. 


Proof: Suppose a; € A and az € A, and that g(f(a1)) = g(f(a2)). We’re 
done if we show that a; = ao. 


Because g is one-to-one, we know 


9(F(a1)) = 9(fla2)) => fla) = fla); 


and since f is one-to-one, we have 


f(a1) = f(az) a1 = a2, 


as desired. 


Proofs 101: Proofs and Proof-Writing 


(a) Given: Raining implies not sunny (true). Converse: Not sunny implies raining 


(false). Contrapositive: Sunny implies not raining (true). 


(b) Given: Raining implies clouds (true). Converse: Cloudy implies raining 


(false). Contrapositive: Not cloudy implies not raining (true). 


Solutions 


(c) 


(d) 


3. (a) 


(b) 


(c) 


(d) 


Given: Raining implies not sunny and cloudy (true). Converse: Not sunny and 
cloudy implies raining (false). Contrapositive: Sunny or not cloudy implies 
not raining (true). 


Given: Cloudy implies not sunny (true). Converse: Not sunny implies cloudy 
(false ...it could be night). Contrapositive: Sunny implies not cloudy (true). 


Converse: If a + 6 is rational, then a and b are both rational. (False.) Con- 
trapositive: If a + bis irrational, then at least one of a and 6 is irrational, too. 
(True.) 


Converse: If 1/a is irrational, then a is irrational, too. (True.) Contrapositive: 
If 1/a is rational, then a is rational. (True.) 


Converse: If ab is irrational, then both factors a and 6 are rational. (False.) 
Contrapositive: If ab is rational, then at least one of a and bis rational. (False.) 


Converse: If limn—o an = 0, then > Gn converges. (False; look at the 
series 14+ 1/2+1/3+....) Contrapositive: If limn—oo an 4 0, then S* an 
diverges. (True; this is known as the nth term test.) 


Negation: All of a, b, and c are negative. 
Negation: f(x) > 3 for some x € [2, 7]. 
Negation: sin n is rational for some positive integer n. 


Negation: Vz € R we have x? # —1. 


Negation: dz € R such that Vy € R we have xy # 1. 


All dogs have fleas. Negation: Some dog has no fleas. Statement P is (prob- 
ably) false. 


Several dogs have fleas. Negation: No dog has fleas. Statement @ is true. 


My dog, Spike, is flea-free. Negation: Spike has fleas. The truth of statement 
R depends on Spike. 


3 < 5. Negation: 3 > 5. Statement S is true. 


V¥n—1+ Vn-+1 is irrational for every positive integer n. Negation: 
Vvn—1-+ n+ 1 is rational for some positive integer n. Statement T’ is 
true. 


n—1+ VWn+1 is rational for every positive integer n. Negation: 
n—1+ n+ 1 is irrational for some positive integer n. Statement U 


cos(x) < 1.001 for all 2 > 0. Negation: cos(a%) > 1.001 for some x > 0. 
Statement V is false. 


cos(x) < 0 for all x € [2,3]. Negation: cos(a) > 0 for some x € [2,3]. 
Statement W is true. 


Every even integer n > 2 is the sum of two (not necessarily distinct) primes. 
Negation: Some even integer n > 2 is not the sum of two primes. Statement 
X is Goldbach’s conjecture—an important unsolved mathematical problem. 
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15 


Solutions 


Every set of ten distinct numbers has a largest and a smallest element. Nega- 
tion: Some set of ten distinct numbers has either no largest or no smallest 
element. Statement Y is true. 

Every subset of the interval [0, 1] contains a largest element. Negation: Some 
subset of [0, 1] contains no largest element. Statement Z is false: (0,1), for 
example, has no largest element. 

If a function f is increasing for all x, then f’(x) > 0 for all x. 

If afunction f has a maximum at x = a, then either f’(a) = 0 or f’(a) does 
not exist. 

If f(x) > 0 for all x, then the graph of f is concave up. 


If a series )~ an converges, then lim an = 0. (This is known as the nth term 
test.) 


Types of Proof 


(a) 
(b) 
(c) 
(d) 
(e) 
(a) 
(b) 


(c) 


True; the simplest proof is indirect. 

False; « = —\/2 is a counterexample. 

True; the simplest proof is indirect. 

False; p = 11 is a counterexample. 

False; « = 0 and y = 1/2 is a counterexample. 

If n = 1, then both sides have value 1; if n = 10, then both sides are 100. 
The base case, P(1), holds as indicated in (a). 

For the inductive step we assume P(n) and try to prove P(n + 1), which is 


the claim that 1+ 2+---+(n+1) = (n+1)(n+4+2)/2. We start by adding 
(n + 1) to both sides of P(n), and then do algebra: 

1 
142434--tnt(nt1)=2@4) 4 gay 


=(n+1)($ +1) _ (n+ I(n+2) 
2 2 
This shows P(n + 1), and completes the proof. 


Calculate with sums: 


n 


= 7 n(n + 1) _ 2 
S7(Qn-1)=25 >n-So1 2. 


n 
k=1 k=1 k=1 


5. Note first that P(n) is obviously true for n = 4; just check both sides. 


To complete the inductive proof, we assume P(n) (for n > 4) and show P(n + 1). 
By P(n), we have 2” < n!. We know also that 2 < (n +1). Multiplying these 
inequalities gives 


Oem > 2 < alin +1) = (n+ 1) 


This shows that P(n + 1) holds, and we’re done. 
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7. The base case P(1) is easy, so we assume P(k) and show P(k + 1). 


In P(k + 1), the left side is (1+ 2+3+---+k+(k+1))?; we'll show that 
it equals 1° + 2? + 33 +.--- + (n+ 1)3. Along the way we’ll use the result of 
Example 8. 


Here goes (some details left to you): 


(1+ 2+---+kh+(K+D)? =((1+2434+---+4)4+(K4+0)) 
=(142434---+k)? +21 42434+---+4)(K+D + (R41)? 
k(k +1) 
2 
= 194298 4-.-462 +(b +1), 


= Pap oP be eR ao (Kab) (R41)? 


as desired. 
9. (a) Wecan simply check by direct calculation (technology helps) that 5" > n! 
forn = 1,2,3,...,11. 
(b) The proof is by induction. 


The base case n = 12 holds by a direct calculation. For the inductive step, 
suppose 5° < k! for a positive integer k > 12. Then clearly 5 < & +1, and 
so5-5* < (k+1)-k!; equivalently, 5*t* < (k + 1)!, which completes the 
inductive step. Done. 


11. The base case is our assumption (i). The inductive step follows easily from the 
product rule. 


13. This is a standard old brain-teaser. The inductive step argument works fine for 
n = A2, but it fails when n = 1. 


1.6 Sets 103: Finite and Infinite Sets; Cardinality 


1. (a) To show f(x) = 7x — 2 is one-to-one, we need only show that if f(71) = 
7x1 —2 = 7x2 —2 = f (x2), then x1 = x2. This is easily done by manipu- 
lating the equation 771 — 2 = 7x2 — 2. 


Showing that g(a) = x/(1+ «) is one-to-one is similar: 


= => U1 +%1%2 = %2+%2K1 => %1= 2. 


(b) To see f is onto, let b € (—2,5). If we set a = (y + 2)/7, then a € (0,1) 
and f(a) = b. 


(c) f og: (0,00) — (—2,5) is one-to-one and onto because this is true of both 
f and g. This means that (0, oo) and (—2, 5) have the same cardinality. 


3. f can be one-to-one but g cannot. The latter holds by the pigeonhole principle, 
since g tries, in effect, to put 43 pigeons into 42 holes. 
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de 


9. 


11. 
13. 


14. 


1.7 


Solutions 


(a) The function f : N > N given by f(n) = n + 1 works. 
(b) The function g : R > R given by g(x) = arctan x works. 
(c) Defineh: R— Rby h(x) =avife €Nandh(x)=2+1ifxeEN. 


Proof by contradiction. If the hypotheses hold and A \ B is finite, then A = 
(A \ B) U B is the union of two finite sets, and hence finite itself. This is absurd. 


(a) Members of Q3 correspond to (some of the) points (p, q) with integer coordi- 
nates inside the circle p? + q? = 3—there are nine such points (p,q). Since 
we’re interested in fractions p/q, we may as well require q > 0. This reduces 
the eligible points to just three: (—1, 1), (0,1), and (1, 1), all of which give 
different reduced fractions p/q. 

Note that Qa, is the same set as Q3. 


For Q1o, similar reasoning finds 11 points (p,q) with q > 0 and p?+q? < 10. 
These 11 points correspond to 7 different rational numbers: +2, +1, +0.5, 
and 0. 


(b) If p/q € Qi, then clearly both |p| << and |q| < 4. There are at most 2i — 1 
choices for each of p and q, and so no more that (2i — 1)? choices for both 
p and q. (In fact, the number of choices is less, but that’s OK for present 
purposes.) An even rougher (but simpler-looking) upper bound is that Q; has 
fewer than 42” elements. 


(c) All parts are straightforward implications of the definitions. 


Here is one possible bijection f : [0,0o) — (0,00): Define f(x) =x+1lifeEeZ 
and f(x) = x otherwise. 


The set B is countably infinite. The set B,, of n-character books is clearly finite for 
each n, and B is the union of the (countably many) B,. 


(a) A polynomial of degree n in Z[z] is essentially an (n + 1)-tuple of integers 
(don’t forget the constant term), i.e., an element of the (n + 1)-fold product 
ZxZ x ...Z. This product is countable, by Proposition 1.18, page 50. The 
set Z[x] consists of polynomials of all possible degrees, and is therefore a 
countable union of countable sets. 


Numbers 102: Absolute Values 


(a) x € (— 

(b) xe (- U 
(c) ae (- aay 
(d) x € (1,00). 


(e) xe (- 2.07, — V3.93) U (V3.93, VZ07). 


9,2). 
O09) 


Solutions 


3: 


(a) Itis enough to prove the RTI under the assumption that |x| > |y|. (If |z| < |y| 
we can just reverse the roles of x and y.) In other words, we need to show 
that |2 — y| > |x| — |y|. But this is equivalent to |a — y| + |y| > |z|, which 
is just a form of the ordinary triangle inequality. 


(b) The distance from x to y is at least as great as the distance from |z| to |y|. 
In other words, taking absolute values of numbers either shrinks distances or 
leaves them alone. 


(c) The RTI is an equation except when x and y have opposite signs. 


This view of things helps explain the triangle inequality name: the length of one 
side of a triangle is not greater than the sum of the other two sides. 


Use the triangle inequality: |x —y| = | -7+7-—y| < |jc—7|+|7-y| = 
jc—7|+|y-—7] <e+e=2e. 


(a) |x —1| < 0.5 implies 1 — x < 0.5, orz > 0.5. 
(b) Use the triangle inequality: |c| = |c—x+2| < |ce—2|+|2| < tcl + |a|. 
This implies |c| < lel + |x|, or |x] > lel 
An integral So f(x) dx is a limit of approximating sums, like f(a1)Aavi+ 
f(xv2)Aait+---+f(an)Arn. Thus lJ f(z) da| is approximated by | f(v1)Avi+ 


f(a2)Aait+ ... +f(an)Azy|, and the integral iC |f(a)| dx by |f(v1)Aai| + 
x2)Aa1| +--+ + |f(an)Avn|. The usual triangle inequality applies to these 
A A Th 1 triangle inequality appli h 
sums. 


Bounds 


(a) The shortest English words have one letter, so 1 is a lower bound for f on EW. 
It’s hard to identify the longest English word, but 1000 letters is probably a 
safe upper bound. 


(b) Words in this problem vary in length from 1 to 10 letters. 


English sentences vary in length from one word (Stop!) to many, many words, but 
1000 words is probably a safe upper bound. On the set SITP, values of g vary from 
5 words to about 30 words, depending on how you count words in equations. 


Suppose S is bounded above by / and below by m. Then S C [m, M]. If we take 
K to be the larger of |m| and |M|, then S Cc [m, M] c [-K, K]. 


(a) The maximum value is f(0) = 1; the minimum is f(+A) = 1/(K +1). 


(b) The graph is a parabola with vertex at x = —1/3, so the maximum occurs 
at the right endpoint; g(K) = 3K? +2K — 7. If K > 1/3, then —2/3 € 
[—K, K], and so the minimum value is g(—1/3) = —22/3. If K < 1/3 then 
the minimum occurs at « = —K,, and the value is g(—K) = 3K? —2K —7. 


(c) Note that h(x) = 1+ 2, so the function is largest at the right endpoint 
(h(K) = 1+ K) and smallest at the left (h(—K) = 1— kK). 
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Solutions 


11. (a) S = (1,00) is one possibility; there are many. 


(b) Let S be bounded away from zero. Then there exists 6 > 0 so |s| > 6 for all 
s € S. Thus, if t € T we have |t| = |1/s| = 1/|s| < 1/6, so T is bounded. 
The proof of the converse is similar. 

(c) If Sis bounded away from 0 then—by definition—S has no points in common 
with the interval (—6,6), which is of the desired form. If, conversely, S has 
nothing in common with (a,b), where a < 0 < b, then we can take 6 to be 
the smaller of —a and b. 


13. (a) Ifa ¢ J, then either a < 0 ora > 1. In the former case, we can take 
6 = —a/2; in the latter, 6 = (a — 1)/2 works. 


(b) The points 2 and 3 are not in J, but J is not bounded away from either 2 or 3. 


(c) N does indeed have the property in question: if a ¢ N. then we can take 6 to 
be the distance to the nearest integer. 


15. Suppose S' were bounded away from @. Then there is 6 > 0 such that B—s > 6 
for all s € S. But this means 6 — 6 > s for all s € S, and so G — 6 is an upper 
bound for S. This means ( is not the /east upper bound, a contradiction. 


1.9 Numbers 103: Completeness 


1. (a) The set S is bounded above by 2. 


(b) Calculation shows that (3 + h)? < 2; this contradicts the fact that 3 is an 
upper bound for S. 


(c) Calculation shows that (@ — k)? > 2; this contradicts the fact that 3 is the 
least upper bound for S. 


1 1 1 1 
: ibilit is the list —~, —~, —~,..., ———__........ 
3. (a) One possibility (among many) is the lis 508? 509’ B10? SOV an’ 
i Lege Ces cen ee Rigger 1 1 
b) O ibility is the list — + =, -~+-,-+4+-,...,- gual, 
(b) One possibility is cls hes fg arg Sean 


(c) One possibility is to let h = b — a, and use the list a + h/2, a+ h/3,a+ 
h/4,...,a+h/n,.... 
4. For given € > 0, consider the positive number 1/e. Since N is unbounded, there 
exists an integer n with n > 1/e. It follows that 0 < 1/n < e, as desired. 
5. (a) The claim is true if 0 < b < a; we can just letn = 2 (or even n = 1 if 
b<a). 
(b) The Archimedean claim is clearly false if a = 0 anda < b. It is also false if, 
say, a = —landb=1. 
7. (a) The italicized statement is still true with Z in place of R. A set of integers 
that is bounded above has a /argest member, which is an integer. 


(b) The italicized statement becomes false if R is replaced by Q. A bounded set 
of rationals has a supremum £3, but 3 may be irrational. 


Solutions 


9. (a) Ifnot, then a, > by, for some m and n. If n > m, then we have an, > am > 
bn, which contradicts the definition of the interval I, = [an, bn]. If m > n, 
then we have am, > by > bm, another contradiction. 


(b) By the preceding part, b, is an upper bound for A, and so can’t be smaller 
than a, the least upper bound. 


(c) Asin the preceding part, we have an < @ for all n. The outermost inequalities 
restate the definitions. 


2.1 Sequences and Convergence 


1. For the three sequences given: 


« | 1.000 0.100 0.010 0.001 
(a2) N] 1 V10%316 V100=10 vV1000% 31.6 
(b) N| 1 100 10, 000 1, 000, 000 
(c) N 1 e? = 8103 e% e999 


2. (a) Ifa given N works for « = 0.001, then the same N works for larger «. We 
could use this N in all positions in the row. 


(b) Any N works in this case; we could use N = 0 in all positions, for example. 


(c) The number of digits in a number changes after 9, 99, 999, etc. So the second- 
row entries would be 9, 10° — 1, 10°° — 1, and 10°°° — 1. 


3. (a) Set € = 0.1 and choose a corresponding N as in the definition. All x, with 
n > N lie in the desired interval. 


(b) Set € = 0.001 and choose a corresponding WN as in the definition. Then all 
Ln with n > N lie in (4.999, 5.001), and therefore exceed 4.999. 


(c) If we set € = 1 and choose a corresponding JN, then all x, with n > N lie 
in (4,6). Thus it is possible that x, > 6 only for members of the finite set 
{x1,@2,...¢n}. 


7. (a) Algebraic manipulation gives 


== Qn -2|_ 10 
i 38n+5 3] 9n+15 
and 
10 af 10/e-15 _ 
9n +15 =z 9 


Thus, for given « > 0, the value N = (10/e — 15)/9 works in the definition. 
A formal proof resembles that in Example 2. 
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(b) 


(c) 


ll. (a) 


(b) 
13. (a) 


(b) 


(c) 


(d) 


Solutions 


The proof is like that for (a), except that now we have 


een 899900 wae 899900/e — 15 ses 
fe ~ 9n+15 = 9 

Thus N = (899900/e — 15)/9 works for any given € > 0. 

We’ ll show cy, — 0. Observe first that 

_ 2n Z 2n 2 

~ 3n2 +5 ~ 3n2— 3n’ 


len — L| 
and 


ee —=> chee 
3n 3¢€ : 


Here’s the formal proof: Let « > 0 be given; set N = 2. This N works, 
since ifn > N then 


_ 2n ge : 
~ 3n2 +5 ; 


ie — 
len 3n ~ 3N 
We can check explicitly that 1.511 + 86.5 < 89 = fi, and 1.5! & 
129.7 < 144 = fie. For the inductive step, note that 


fess 1S $15 15 95 S15) 1 S15, 


as desired. (Note that the proof worked because 1 + 1.5 > 1.5. A similar 
result can be shown to hold if 1.5 is replaced by any number 6b with 1 + b > 
b?.) 

Using technology one can check that n > 72 works. 


For any given € > 0, we see 


1 1 

jan -OJ=—<e = n>=. 

n € 

Thus the desired inequality holds for all n except those (finitely many) for 


which n < 1/e. 


Show that if x, — 0 in the sense above, then 7, — 0 in the “official”? sense 
defined in this section. Let « > 0 be given. By hypothesis, |, —0| > € holds 
for only finitely many n, and we can choose N to the largest such value of n. 
This N works in the ordinary definition. 


A sequence {xx } does not converge to 0 if, for some € > 0, there are infinitely 
many &p with |vn| > €. 


A sequence {x,} does not converge to 0 if, for some € > 0 and any number 
N, there is some xp with n > N and |rp| > €. 


15. Every constant sequence has such a table. 


17. If G has decimal expansion 0.didzd3dads..., then the increasing sequence 0.d1, 
0.did2, 0.didads3, 0.didzdzd4,... converges to (. 


Solutions 


2.2 Working with Sequences 


1. 


Here is a sketch. For any constant c 4 0, we have 


|can — ca| = |c||an —al <e lan —al < iq 


Now for any given e« > 0, we can choose N so that jan — al < €/|c| forn > N. 
(Why can we do this?) This N “works” for the given € and the original sequence 
{cay}. (Assemble the pieces into an efficient proof.) 


Suppose x, — 0. To show |rn| — 0, let € > 0 be given. Because x, — 0, we can 
choose N so that |v» — O| = |an| < € whenever n > N. The same N works for 
the sequence {|xn|}, because if n > N, then 


||an| — O| = |lan|] = lan] <e, 


as desired. 


(a) The lemma says that there exist both rational and irrational sequences that 
tend to J/2, and that these can be chosen to be monotone. 


(b) One possibility is pn = V2 + 1/n for all n. 


(c) One possible sequence comes from the infinite decimal approximation 
J2 = 1.414213562...; we can use r; = 1, ro = 1.4, r3 = 1.414, ete. 
(We need a little fix when a zero-digit comes along.) Another possibility, a 
bit less concrete, is to use the fact that every interval contains rational num- 
bers. 


The n = 1 (base) case is easy: s2 = 1.5 < 2 = 81. The inductive step is to show 
that if sn, — Sn—1 < 0, then s,41 — 8, < 0, too. Doing so involves careful but 
straightforward algebra. 


. It is clear that {hy} is increasing, and the inequality han > hn + 3 implies that 


{h,} is unbounded. Since hi = 1, the inequality means that hz > 1.5, ha > 2.0, 
hg > 2.5, heig99 > 501, ete. 


(a) Clearly, T;, < S, for all n, and both sequences appear to converge. 


(b) It is natural (and correct) to guess T;, = —"e The inductive proof is 
n 


straightforward. 
(c) We can use algebraic facts: T, a d d 
: SS SSS 05S eS 
. nail” i+ijfn. 140 


(d) Observe that 
1 ko 
mtn n+n2 Qn? 
for all n, and so S;, < 2T, < 2 for all n > 1. Thus {T7;,} is bounded above, 
and clearly increasing, so {T;, } converges. 


(a) Let M > 0 be given. Since zn, — oo, we can choose N so tn > M 
whenever n > N. But then also —x, < —M whenn > N. Thus —2y - 
—0o. 
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11. 


2.3 


Solutions 


(b) Suppose x, — oo. Let € > 0 be given; we need to find N son > N implies 


|1/an| =1/an < e. Well, if we set M = 1/e then (since x, — 00) we can 
choose N son > N implies zn > M = 1/e. But this implies 1/rn < e, as 
desired. 


(c) Suppose zn < 0 for all n. Then x, — —co if and only if 1/rm — 0. 


This follows from parts (a) and (b): tn oe) Di oo 
-l/a,r 3 0 <> 1/an — 0. 


(a) The limit is 2/5. 
(b) The sequence diverges to oo. 
(c) The sequence converges to 1/2. 


(d) The sequence diverges to oo. One strategy is to observe that 


n? + arctann n?—4 
ee SS SS 
n+2 n+2 


Subsequences 


(a) The converse says (falsely) that if {2,} has a convergent subsequence, then 


{xn} is bounded. Counterexamples are easy to find. The contrapositive says 
that if {a} has no convergent subsequence, then {x, } is unbounded. 


(b) The BWT says that {y,} has a convergent subsequence. The BWT also ap- 


plies to {zn}, but isn’t really needed, since it is easy to see that z, — 0. 
Thus, every subsequence of {z, } converges to zero. 


(c) If {an} is bounded, then the BWT guarantees that a convergent subsequence 


exists. If {a, } is unbounded, then Lemma 2.13 applies. Both can occur, as 
they do in the sequence 1, 0, 2, 0, 3, 0, dots. 


(a) Given a sequence that lists the rationals, we can just form the subsequence of 


nonnegative rationals. 


(b) Look at any two successive terms, say pi; and p2. Between these two rational 


numbers like other rationals. If the sequence were monotone, these other 
rationals would lie between p; and pe in the sequence. 


(c) We can choose 71 so that pn, = 1. Then we choose nz with n2 > n1 and 


Pn = 2. Continuing this process completes the proof; details are left to the 
reader. 


(a) Substituting n = 2k gives v2, = 2k/(2n+ 1), which converges to one as 


k — oo. Substituting n = 2k — 1 gives vo,-1 = —(2k — 1)/2k, which 
converges to —1. With two different subsequence limits, {x,, } must diverge. 


(b) Consider any convergent subsequence %n,, Lng, Lng, Lng, ---, With limit 


L. Since this sequence has infinitely many terms, it must include either in- 
finitely many even-indexed or infinitely many odd-indexed terms from the 
original sequence {xp }. If there are, say, infinitely many even-indexed terms, 


Solutions 


11. 


then they form a new subsequence, which obviously tends to one. But every 
subsequence of a convergent sequence tends to the same limit, so we must 
have L = 1. 


Since {x,,} diverges, xo is not the limit. Negating the definition of convergence to 
Xo means that, for some € > 0, no N works in the definition of convergence. We’ll 
use this to construct a sequence. 

Because N = 1 doesn’t work in the definition, there is some term—call it x,,, —for 
which |an, — 2o| > €. Now consider N = nj. But this N doesn’t work either, 
so there must be some term—call it x,,—for which nz > ni and |Eno - xo| >e. 
Continuing this process indefinitely gives the desired subsequence: ny < no < 
ng <... and |%n, — Xo| > € forall k. 


If x» — 2o, then we know that every subsequence, and hence every monotone 
subsequence, must also converge to xo. 

The other implication is that if every monotone subsequence converges to xo, then 
Ln — Xo, too. We prove this by contradiction. 

If x» does not converge to xo, then for some € > 0 there is a subsequence {xn, } 
with |Cn, —2xo| > e€ for all k. By Proposition 2.12, there is a monotone subse- 
quence {Yyn, } of {an,}. Now |an, —2o0| > € holds for all the z,,, and so it 
certainly holds for all the yn,, too. In particular, yn, —> Xo is impossible, which 
contradicts our assumption about monotone sequences. 


Both parts follow from unraveling the e—N definitions. 


2.4 Cauchy Sequences 


1. 


(a) For given € > 0 wecan choose N = 2/e. This N works because if m,n > N 
then |%n —2m| < |tn|+|em| =++4< $+ § =e. 

(b) The sequence is not Cauchy. If we choose, say « = 0.001, then no N works: 
if N is any positive number, no matter how big, then with m = N + 1 and 
n= N + 2we have |yn — ym| = 2/1234 > 0.001. 


(c) The sequence is Cauchy. Note that ifn > m, we have 


n m n-—m 
|2n — 2m] = | aa ~ (nt+1(m+1) 
es Ss 
nm nm om 
It follows that, for « > 0, we can choose N = 1/e. 
(d) The sequence is Cauchy. If n > m, we have 
sinn sinm sinn sinm 
eS Wel eee ogee peed | eed 
2 


< 


m1 ~~ mi? 


and 2/m? < cifm > \/2/e. Thus, N = \/2/e works in the definition. 
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2.5 


Solutions 


For € > O choose N that works for € in the sense of Definition 2.16. This same N 
works in the definition of convergence to zero. To see why, let m > N be given. 
Then choose any n of the form n = 10*, withn > m. (This can certainly be done; 
n = 10” is one possibility.) Then we haven > m > N, and so |am — 2n| = 
|Lm — O| < e, as desired. 


(a) Any sequence of rationals tending to an irrational will do. 
(b) Yes. The only Cauchy sequences of integers are eventually constant. 


(c) [0, 1] is complete because if {x,,} is any Cauchy sequence in [0, 1], then {x,, } 
tends to some limit L. Because 0 < a, < 1 for all n, L lies in (0, 1], too. 
(0, 1] is not complete; {1/n} is a Cauchy sequence in (0, 1], but its limit, 
zero, lies outside (0, 1]. 
Let « > 0 be given and set «’ = €/42. Because {ay} is Cauchy sequence there 
is some N such that |an — a@m| < ¢’ whenever n > N. This N works for the 
sequence {xp }; details left to you. 


Two numbers with the same first n decimal places can differ by no more than 107”. 


. The key point is that, regardless of n, 4+ + ster +++ + oe < gk. This fact 


can be parlayed into a proof. 


Series 101: Basic Ideas 


(a) S;, = 0 for all n, and so the series converges to 0. 
(b) S, = 42n for all n; this diverges (to 00). 


(c) The partial sum sequence {.5,,} has the form —1, 0, —1, 0, ...; this di- 
verges. 


(d) Sn = (n(n + 1))/2 for all n; this diverges (to 00). 


(ec) The series is geometric, with r = 0.99, so S, = (1—r")/(l—-r) = 
100(1 — 0.99”), and we see that S;, — 100. 


(f) It is easy to show by induction that S, = n/(n +1). Thus S, — 1. 


= - oe ye 

(a) kar converges by comparison to the geometric series be OR’ 
k=1 k=1 

(b) > i diverges by comparison to the divergent series > oe (Con- 
Le ees ee 2 2+ 3k 


vince yourself that the comparison works.) 


So. k 


co 
F , , 2 
(c) s 38 converges by comparison to the convergent geometric series s 3K 
k=1 


k=1 
(d) se ke +2 diverges by the nth term test: lim k* +2 = #0 
2x 3h2 Hg OES OY "koe BRP 4 BT 


Solutions 


5. Let S> ay, and 5~ by be series, and let S~ cz, be the “sum series,” defined by cy, = 
an + bx for all k. Let An, Bn, and C;,, denote the partial sums for these series. 
The key point is that—thanks to the commutative law for addition of finitely many 
numbers—C’,,, = A, + By for all n. It follows that if A, — A and B, — B, then 
we must also have C;, — A+ B, which is what we wanted to prove. 


(a) It is enough to show (i) ifm > 1000 then S,, > S000; (ii) if n > 1000 then 


de 


2.6 


(b) 


(c) 


(a) 


(b) 


Sn < Si001. Claim (i) amounts to observing that every string of the form 


1 1 et Ml 
1001 1002 " 1003 ~ 1000 +n 


adds up to a positive result; group summands in pairs to see why. Claim (1i) 
holds because every string of the form 


1 1 1 al 


~Jo02 1003 ~ 100d * °° * I000 Fn 


has negative sum; again, group summands in pairs to see why. 


For given « > 0 we can take any integer N > 1/e. Then |Sn — Sw4i| = 
1 

Net = & 

For given € > 0 choose WN as in the preceding part. Then forn > m > N, 


we have both S;, and S,, between Sw and Sv +1, and thus within € of each 
other. 


Let A, and B,, be the partial sums for )~ a;, and )~ b;. The hypothesis boils 
down to the fact that for n > 42 we have B, = An — Aag + Bag. Thus, the 
sequences {A,} and {B,} differ (for large n) by an additive constant, and 
so both converge or both diverge. 


We know (see the preceding part) that B, = An — Aso + Baz. Because 
be — ax = k?, we must have Baz — Asa = paar 2k? = 25585. Thus, 
Bn = An + 25585, and since A, — 100 we must have B, — 25685. 


Series 102: Testing for Convergence 
and Estimating Limits 


(a) 
(b) 
(c) 


(d) 
(a) 
(b) 


yo, 1/(k + 2") converges by comparison to 77°, xr: 

yx, k/(2k? — 1) diverges; one approach is limit comparison to 77, i: 
as + converges by comparison or limit comparison with the geometric 
series 0°, zs 

yr (k? + 2)/(3k? + 4) diverges; use the nth term test. 


y%., 100 /k! converges by the ratio test. 


yo, sink/ 1.0001” converges absolutely. The absolute value series con- 
verges by comparison to the geometric series )77-_, 1/ 1.0001". 


273 


274 


11. 


3.1 


1. 


Solutions 


(c) 3, (k? +k 4+ 1)/(k? +k? + k + 1) diverges; one approach is limit com- 
parison to 0°, ¢- 


(d) 3%, (-1)*/(InIn(k + 2)) converges by the alternating series test. 


(a) The ratio az /Qy = Gx tends to zero (by the kth term test), and so >> az 
converges. 

(b) If San = > Tr then S> ag diverges. If Sy ax = > G, then Saf con- 
verges. 


If S~ by converges to B, then }~* ax converges to B — Sear be + Sia az. This 
is similar to a problem in the preceding section, where K = 42. 
Note that the sum 
1 1 1 ‘L. 1 n 
Sn =—mtometee et . = 


ie Va” Va 
Since \/n — 00, we must have S$’, — 00, too. 
(a) The series converges by comparison to 57> 7°, 1/ 2*—1 which converges to 2. 
(b) We have Rio = S322, 1/(2*71 +1) < 2, 1/2"? = 1/2?. 
(c) We have S = S39 + Rio < 1.26255 + = ~ 1.26450. This means that S 
lies somewhere in the interval [1.26255, 1, 26450]. 


(a) See your favorite calculus text. 


(b) We need to choose n so Rn < 0.001. By the given inequality, this holds if 
n is large enough so that i a dx/x* < 0.001. A calculus calculation shows 
that [°° dx/x* = 1/2n7, and the last quantity is less than 0.001 if n > 23. 
This means that S23 ~ 1.2012 is within 0.001 of the true answer. 


Limits of Functions 


Both parts are easy exercises with the definition. In (a) we can choose any 6 for 
given ¢. In (b) we can choose 6 = €. 
(a) For f(x) = x, we have limz_+42 f(x) = limz—42 x? = 42? = f (42), so 
the condition does hold at a = 42. 


(b) For f(x) = x? and any input a, we have limz—a f(z) = limz—a ee = 
a” = f(a), so the condition holds at every a. 

(c) Since the function f(z) = (a? — 4)/(a — 2) is not defined at a = 2, the 
condition can’t possibly hold. 

(d) We have limz—3 f(2) = limz3(x? — 4)/(a — 2) = (3? — 4)/(3—1) = 
f (3), as desired. 


(e) Note that a = Ois an endpoint of the domain. Also, we know lim,_.9+ f(x) = 
lim,_.9+ V/& = 0 = f(0), so the condition holds in the one-sided sense. 


Solutions 


(f) We have limz—a f(x) = limz—a |x| = |a| = f(a) for all a. 


(g) Theorem 3.3, Page 138, assures that limz—a f(x) = limz—a(1 +orrten? + 
nex*) =1+4+2a+ ea? + rea® = f(a), so the condition holds for all a. 


5. Suppose that lim,—a f(x) = L, that xn — a, and that x, 4 a for all n. To see 
that f(an) — L, let e > 0 be given. Because limz—.2 f(a) = L, we can choose 
5 > Oso that | f(a) — L| < € whenever 0 < |x — L| < 6. Because x, — a and 
6 > 0, we can choose N so that |z, — al < 6 whenever n > N. 


This N works to show f(an) — L, since ifn > N, then we have |x, — a| < 6, 
and so so |f (an) — L| < €, as desired. 


7. (a) We say limz—. f(x) = L if for all « > 0 there exists M < 0 such that 
| f(x) — L| < e whenever x < M. 


(b) We say lim, _,9- g(z) = L if for all € > 0 there exists 6 > O such that 
|g(a) — L| < € whenever —6 < x < 0. 


(c) Following is the idea of a proof that limz.—. f(x) = L implies that 
lim,_.9—- f(1/x) = L. The converse is proved similarly. 
Let « > 0 be given. We want to show that | f(1/x) — L| < € for x in some 
interval (6,0). By hypothesis, there is some M < 0 so that | f(t) — L| < « 
when t < M. Writing t = 1/z, this means that |f(1/2) — L| < ¢ when 
1/x < M < 0, or, equivalently, when 0 > x > 1/M. (The inequality 
algebra is a bit tricky because both M and z are negative.) This implies that 
we can take —6 = 1/M, or, equivalently, 6 = —1/M. 


8. (c) To show that the squeezing inequality holds for all « > 0, use the “conjugate 
trick”: 


eo (/1 + 2/2 —1)4/1 + 2/2 +1) 
er Pree es 


2/a 2 


= — < -. 
J1l+2/e+1 « 


9. (a) Wecanuse 6 = 0.0024 (or less). Note 6 © €/4. 
(b) We can use 6 = 0.00005 (or less). Note 6 © €/200. 


(c) With « = 0.01, the value 6 = 0.001 does not quite work at a = 10. (The 
value 6 = 0.0005 does work.) 


11. The proof for a € Z is the same as in the preceding problem. If a ¢ Z, then we can 
take 6 to be the (positive!) distance from a to the nearest integer. 


13. What matters is that for the finite set S there is a smallest distance, say d, between 
any two points in S. To show limz—.a j(x) = 0 when a € S, we see that 6 = d 
works for any € > 0. 


To show limz—.2 j(x) = 0 when a ¢ S, we can choose any € > 0 and let 6 be the 
smallest distance from a to any point of S. 
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3.2 


Solutions 


Continuous Functions 


1. Suppose first that A 4 0. Let € any positive number. Let 6 = €/| A]; note that 6 > 0. 


If |z—c| < 6, then | f(x) — f(c)| = |Av+B—(Ac+B)| = |A||x—c] < |Alé =e. 
If A = 0 then f is a constant function. In this case any number 6 > 0 works. 


(a) For any « > 0 we can choose 6 = € in the definition, since if |z—a| < 6 =e, 
then | f(a) — f(a)| = ||a|—|a]| < |z-—a| < ©. (We used the reverse 
triangle inequality.) 


(b) Use (a) and Proposition 3.12. 
(a) One approach is to note that lim,_,9+ sign(z) = 1 but lim,_,9— sign(a#) = 
—11. This implies that the ordinary limit cannot exist. 


(b) Ifa # 0, we have sign(x) = sign(a) for x in some interval about a, and so it 
follows that limz_.2 sign() = sign(a). 


7. Note that lim, 3 f(a) = f(3). Let e = f(3) — 5. If we choose any 6 > 0 that 


9. 


3.3 


2. 


works for this €, we’re done. 


(a) The discontinuity of f(x) = 1/x at a = 0 is not removable; limz—.o f(x) 
does not exist. 


(b) The discontinuity is removed if we set f(—2) = —4 = limz—-2 f(z). 
(c) The discontinuity is removed if we set f(0) = 0 = limz—o f(z). 


Why Continuity Matters: Value Theorems 


(a) With 6 = 1, we can use M = 43°, since |f(x)| = |x? | < 43° when 
x € (41, 43). (Other values of 6 and M are possible, too.) 

(b) Let « = 1. Since f is continuous at c there exists 6 > 0 such that x © 
(c — 6,c + 6) implies f(x) € (f(c) — 1, f(c) + 1). In particular, for these 
x we have |f(x)| < |f(c)| + 1, so we can take M = |f(c)| +1. 


_ jl ifzgeQ@ 
wey={ ifa gQ. 


This example works because f is not continuous anywhere. 


(d) One possibility is the function defined by f(x) = 1/zx if x # O and f(0) = 
42. 


(c) One possibility is 


(a) First write 


Now, as x tends to infinity, the quantity in parentheses tends to one and so the 
product tends to oo. 


Solutions 


~] 


11. 


(a) p(x) = x(x? + 1)? is one possibility. 

(b) p(x) = x(a — 1)(x — 2) is one possibility. 

(c) p(x) = (2? + 1)?! is one possibility. 

(d) p(x) = «(a — 1)(x — 2)--- (a — 40) is one possibility. 


(a) The minimum value of p on [0,3] is p(0) = 0. The EVT guarantees that p 
must assume such a minimum value. 


(b) The minimum value of g on (0, 3] is g(3) = 0. The EVT doesn’t apply here, 
since (0, 3] is not a closed interval. 


(c) Factoring gives p(a) — 4 = a(3 — a)? — 4 = (a — 4)(a — 1). This shows 
that p(1) = 4 and p(a) — 4 < 0 when 0 < a < 3. In other words, p(1) is the 
maximum value. 


. The graph of f is an upward-opening parabola with vertex at x = 1. Thus f(z) 
takes the minimum value f(1) = —1 at 2 = 1. No maximum value is achieved on 


(0, 3). But note that f(—1) = 3 is the supremum of the set { f(x) | « € (0,3)}. 
(a) f(x) = |2x — 1| is one possibility. 
(b) f(x) = (sin(27a) + 1)/2 is one possibility. 


(c) No such function exists. A continuous function on [0, 1] must achieve a min- 
imum value. 


(a) One approach is to modify the proof given for the PoPI. Another approach is 
to define g(a) = — f(x), which is also continuous at z = c. Now f(x) > K 
is equivalent to g(x) < —K. Applying the (already known) PoPILTV to g 
gives the desired result. 


(b) We have g(0) = 0 < 1, but there is no two-sided input interval (—6, 6) on 
which g(x) < 1. Thus g fails the PoPIGTV. On the other hand, if we have 
g(0) =0 > K, then g(x) > K for all x, so g satisfies the PoPILTV. 


(c) The function h satisfies the PoPILTV but fails the PoPIGTV (consider K = 
0.5, for instance). 


(d) The “only if” part has already been shown. For the “if” part, let € > 0 be 
given. Then f(c) < f(c) + 6 and so the PoPIGTV gives 5; > 0 such that 
f(x) < f(c) + efor x € (c— 61,c + 61). By the PoPILTV there is 62 > 0 
such that f(a) > f(c)—efor x € (c—62,c+62). Setting 6 to be the smaller 
of 6; and 62 completes the proof. 


13. Suppose toward contradiction that f is not strictly monotone. Then there exist 


points a, b, and c in J with a < b < c buteither (i) f(a) < f(b) and f(b) > f(c); 
or (ii) f(a) > f(b) and f(b) < f(c). Both (i) and (ii) lead to contradictions. In 
case (i), let v be any number such that both f(a) < v < f(b) and f(b) > uv > f(c). 
By the IVT we have f(d) = v for some d between a and 6, and also f(e) = v for 
some e between b and c. This contradicts the assumption that f is one-to-one. 
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3.4 


Solutions 


Uniform Continuity 


(a) 


(b) 


(c) 


(d) 
(e) 


(a) 


(b) 


(a) 
(b) 


(c) 


(d) 


(a) 


Let 5 = /2 — 1. This works for € = 1 at c = 1 because if | — 1| < 6, then 
|2? 1) =|2-1||2+1]< (v2-1) (v2+1) = 


Let 6 = V101 — 10. This works for « = 1 at c = 10 because if |x — 1| < 6, 
then 


|z? — 10?| = |x — 10||x + 10| < (vi01 — 10) (v101 + 10) 0 


Values of 6 that work for atc = 1, c = 10, c = 100, and c = 1000 are 
6 © 0.414, 6 & 0.05, 6 = 0.005, and 6 & 0.0005, respectively. The fact that 
these 6’s decrease to zero means they can’t be chosen “uniformly” in R. 

The value 6 = 0.1 works for g(a) = 10x + 3 and € = 1 at all values of c. 


The value 6 = 0.0005 works for « = 1 on the entire interval [1, 1000]. 
(It doesn’t work on a larger interval.) 
Note that if a and 6 are in [—3, 42), then Ja” + ab + 6?| < 3-42? = 5292. 
The hint implies, therefore, that | f(s) — f(t)| = |s* — t?| = 5292|s — ¢]. 
This implies that, for any « > 0, the value 6 = €/5292 works in the definition 
of uniform convergence on [—3, 42). 


The function f(z) = 2x is continuous on the closed and bounded interval 
[—3, 42], and is therefore also uniformly continuous on the same interval, by 
Theorem 3.22. Since [—3, 42) C [—3, 42], f is also uniformly continuous on 
the smaller interval. 


Mimic Example 1. 
For x, y € [1, 00), we have 
Lf) — sep] = WE val =| aE < be. 
Vit Vi 
This implies that, for any given € > 0, the value 6 = € works. 
The sequence 4, x, x, oo ... is Cauchy, but applying the function h gives 
the sequence 1, —1, 1, —1,..., which is not Cauchy. 


If we set k(O) = 0, then & becomes continuous on the closed interval [0, 1], 
and so Theorem 3.22 applies. 


For given € > 0, choose 6; and 62 such that, for x,y € I, 
€ 
ln—vl <b => [F(@) — F< & 

€ 

|x —y| <d2 => |9(z) — gy) < 5- 


Then 6 = min(61, 62) works for f + g, because if x,y € I and |x — y| < 6, 
then 


I[f(x) + 9(@)] — [F@) + 9@)ll S$ IF(@) — FO) + la@) — 9) 


Solutions 


11. 


4.1 


(b) One of many possibilities is f(z) = x = g(x) on J =R. 


(c) We know from earlier work that the product fg is continuous on I; because I 
is closed and bounded, fg must also be uniformly continuous. 


(d) Suppose |f(x)| < My and |g(x)| < M, for x € I. The first step is a flashy 


calculation: 
lf(x)o(x) — Fya)| = If(w)9(x) — f(y)a(2) + F(y)a(x) — Fay) adding and subtracting 
< |f(w)g(x) — f(ya(x)| + |F)a(x) — Fay) triangle inequality 
= loa) F(x) — FY) + IF MI I9(@) — 9(y)| 
< My |f(x) — f(y)| + Me lo(z) — ay) using the bounds 


From here on we can mimic the proof in (a). 
(a) The function f must be constant (and hence uniformly continuous) on R. 


(b) The function f need only be bounded (and not necessarily uniformly contin- 
uous) on R. 


(c) The function f must be constant (and hence uniformly continuous) on R. 


(d) For some 6, values of f can’t rise or fall by more than | on intervals of 
length 6. Such a function could have jumps, and so need not even be con- 
tinuous. 


For given € > 0, set 6 = €?. This 6 works, since if 0 < x — y <6, then 


Ife) — fl =Vt- Vas Ve-y< Ve =e, 


as desired. 


Defining the Derivative 


The difference quotients for g’(a) and for f’(a) turn out to be identical, so their 
limits must be equal. 


(a) The difference quotient (f(a) — f(0))/(a — 0) is nonnegative for « > 0, so 
the right-hand limit is nonnegative. Since the ordinary limit exists, it must be 
nonnegative. 

(b) We have f’(0) = lim,_.o-(f(x) — f(0))/(@ — 0). Since the difference 
quotient is nonpositive, so is the limit. 


(c) If f has a minimum value at x = 0, then f’(0) = 0. 


. Definition 4.1 gives 


(f9)'(0) = tim LOS) — timn g(x) £7) = g(a) p'(0) =6. 


x20 x x—O0 x 


The limit-of-difference-quotient calculation for g’(2) is like that for f, but for a 
factor of 4. 
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Solutions 


10. The difference quotient calculation for g’(—2) boils down to the same thing as the 
difference quotient calculation for f’(2), so both derivatives have the same value. 
This makes sense geometrically, as the graph of g is obtained from the graph of f 
by translating four units left and raising five units up. 


11. (a) The easy one-sided limits lim,_,,- f(x) = 1 = lim,_,,+ f(x) imply 


lim,—1 f(z) = 1 = f(1), and so f is continuous at x = 1. The limit 
that defines f’(1) does not exist because the corresponding one-sided limits 
are unequal: 
5g (O=ID 5 tee 3g FOLIO _s 
x—1— xz—1 ait a—-l 


(b) One possibility is to set f(a) = 2” — 1; then f’(1) = 2. 


13. (a) Note that g(0) = 0; we need to show that limz_.9 g(x) = g(0) = 0. To do 
this, first choose M > 0 so that | f(a2)| < M for all x € (—1,1). Then we 
have 

0 < |g(x)| = lal [f@)| <M Ia. 


Since the right-most quantity tends to zero as x tends to zero, so must g(x). 


(b) Note that h(0) = 0 and that h(x) = xg(x). By definition, 


h'(0) = lim we) = lim g(x) = 0, 


z-0 @ xz—0 


as shown just above. 


4.2 Calculating Derivatives 


1. One can show that x” — a” = (a — a) (a”~' + ax"? +--- 4+ a”~"); there are 
nm summands. Taking the limit as x — a gives na” — 1 as desired. 
2. (a) By the ordinary product rule, (fgh)’ = (f (gh))’ = f' (gh) + f (gh) = 
f'gh+f(g'h+gh')= f'gh+ fg'h+ fgh'—a nice pattern. 
(b) Iterating the usual chain rule gives (fo goh)'(a) = f'((go h)(a)) - 
(go h)'(a) = f’(g(h(a) ) - g'(h(a)) - h’(a). Here is an even more compact 
version, omitting the “argument” a: ( fogoh)’ = (f’ 0 (goh))-(g' oh)-h’. 


(c) Finding the second derivative involves both the chain (twice) and the product 
ules: (f 09)" (a) = (f’(g(a)) - 9'(a))’ = (f'(g(a)))'-9' (a) + F'(9(a))- 
g(a) = f"(9(a))- 9'(a) + 9'(a) + f'(g(@)) - 9" (a). 

(d) Using first the chain rule and then the product rule gives ( f o (gh) )' (a) = 
f'(gh(a)) « (gh)'(a) = f'(gh(a)) « (gh!(a) + 9'h(a) ), of, alternatively, 
f'(g(a)h(a)) - g(a)h'(a) + f"(g(a)h(a)) - g'(a)h(a). Leaving out the ar- 
guments give a more cryptic form: ( f o (gh) )’ = f' 0 (gh)- gh’ +(f’ 
(gh) )-g'h. 
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(e) If f(x) = Ax + B and g is any function, then (f og)'(a) = Ag(x) + B. To 
find the derivative (it’s Ag’(x)), we can use either the chain rule or the linear 
combination rule. 


The derivative h’ (a) is defined as a limit, which we can evaluate using the definition 
of h and algebraic properties of limits: 


vy Ma) = Wa) _ 5, 3F (x) = 429(a) = 3f(a) + 429(@) 


xr—a xZ—a @r—a x-a 
~ 3 tim LO=FO _ 45 tm 9-91) 
ra rwZ—a La L—a 


= 3f'(a) — 429'(a). 


7. All parts are straightforward but perhaps messy calculations. 


4.3 


Clearly, g(a) = f(a)? = 0. The chain rule also gives g'(a) = 2f(a)f’(a) = 0. 


(a) That (f o g)’(n) = 0 for all integers n follows immediately from the chain 
tule. 


(b) The product rule gives h’(2) = —sin (rx) - 7 - g(x) + cos(mx) - g’(x); 
this is clearly zero for each integer n. 


The Mean Value Theorem 


. If a and b are any two roots of f, then Rolle’s theorem implies that f’(c) = 0 for 


some c between a and 6. Our claim follows. 
If f(z) = 2+sin a, then f has no real roots, but f’(x) = cos x has infinitely many 
roots. 


(a) Note that p(x) = 2° + « = x(x? + 1). The first factor is zero at 2 = 0 and 
the second is always positive. 


(b) If p had two roots a and 6, then, by Rolle’s theorem, there would be some c 
with p’(c) = 0. But p’(2) = 3x? + 1, which is always positive. 
(a) Every odd-degree polynomial has at least one root; we’ve shown this using 


the intermediate value theorem. 


(b) There are many possibilities. The functions fi (2) = x? (witha = b = 0), 
fo(a) = x — 3a + 2 (with a = —3 and b = 2), and f3(x) = a? — 3241 
(with a = —3 and b = 1) have one, two, and three roots, respectively. 

(c) Rolle’s theorem implies that between any two distinct roots of f lies a root 


of f’. If f had at least four roots, f’ (a quadratic polynomial) would have at 
least three roots, which is impossible. 


(d) We know f has at least one root. If a > 0, then f’(x) = 3a? + a, so 
f' (x) > 0 for all x. Thus f’ has no roots, and so f can have at most one root. 


(a) Rolle’s theorem gives x; between a and c and x2 between c and b. 


(b) Rolle’s theorem applied to f’ on the interval [21,22] gives the desired xo. 
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(c) If f : [a,c] > R has continuous derivatives f’, f”,..., f¢? on [a,c], and 
there are inputs 71, v2, ..., 43 with f(a1) = f(v2) =--- = f(xa3), then 
the desired xo exists. 


7. Rolle’s theorem implies that f’ has a root in each interval (0, a), (a, 2a), (2a, 3a), 
.... Since f’ is also periodic with period a, (f’)’ = f” also has roots in each of 
these intervals, and similarly for higher-order derivatives. 


8. (a) By the MVT we have 


f(0) — f(a) 


= =f'() >M 


for all a and b. If b > a we can multiply both sides by (b — a) to get the 
desired inequality. 


9. (a) Every number c in (1, 7) works. 
(b) The only solution is c = 4. 
(c) In this case, the MVT equation f’(c) = (f(b) — f(a))/(b — a) reduces to 
100c°® = 1, which gives c = 1/100'/9° = 0.955. 
(d) In this case, the equation f’(c) = (f(b) — f(a))/(b—a) reduces to 6¢+5 = 
3a + 3b + 5, which gives c = (a + b)/2, the midpoint of (a, b). 


(e) If q(x) = Ax? + Bax + C and [a,b] is any interval, then the mean value 
equation holds if and only if c = (a + b)/2. To see why, note that in this 
case the equation f’(c) = (f(b) — f(a))(b — a) reduces to B + 2Ac = 
B+ A(a+b). 


11. Ifacarhas the same velocity at times a and b, then there must be some intermediate 
time c at which the acceleration is zero. 


13. (a) Consider h(x) = f(x) — g(x). By hypothesis, h’(x) = 0 for all x € (a,b). 
By Proposition 4.11, h is constant, as desired. 
(b) Let h(x) = f(x) — g(x) — 5x. Then h’(x) = 0 for all x € (a,b), and so 
h(a) = C for some constant C’. It follows that f(a) = g(#) + 5a 4+ C. 
15. If we write m = (f(b) — f(a))/(b — a), then L(x) = m(ax — a) + f(a), and the 
rest follows. 
17. (a) Foreach x > 0, we have, by the MVT, 
f(x) — F(0) _ f(x) 


xz—0 x 
for some c € (0,2). Since f’(c) < 1, we get f(x)/x < 1, which implies 
f(a) <a. 

(b) For x < 0 we still have, by the MVT, 
fe) =f) _ f@) 


1. 
xz—0 x = 


Again we multiply both sides by x. Since now x < 0, this implies f(x) > a. 


Solutions 


19. Suppose f(b) > 6 for some b > 0. Then the MVT implies that 


21. 


4.4 


f(¢) -fO=10 _ 10 5 , 


b—0 b 


for some c between 0 and 5b; this contradicts our assumption. 


(a) 


(b) 


(c) 


Let « > 0 be given; set 6 = ec. If s and t are in D and |s — t| < 6, then 
f(s) — f(®)| < |s —t| < 6 =e. Thus f is uniformly continuous on D. 

Let x and y be any numbers. By the MVT, we have (it’s OK to take absolute 
values) | f(x) — f(y)| = If] |e — yl < le — yl. 

The function f(z) = sin x is a contraction, since | f’(a)| = |cosa| < 1. The 


function g(x) is not a contraction. One reason is that g(1) — g(0) =e-1% 
1.72 >1-—0. 


Sequences of Functions 


(a) 


(b) 


(c) 


(a) 


(b) 


For any real number x and « > 0, let N = 1/e. Ifn > N, then 


lin(2) — f(2)| = fale) == <e 


Thus, fn — f pointwise on R. Because the same inequality holds for all x, 
the convergence is also uniform. 


For any fixed real number x we have f,(x) = «/n — 0 asn — oo, so 
fn — f pointwise on R. The convergence is not uniform on R, however. 
For « = 1, for instance, no N works because, for every n, we have | fn (x) — 
f(x)| =|a/n| > 1 for some x € R. 


For any fixed real number x, we have f,(a) = sin(a)/n — 0asn — oo. 
Thus, f, — f pointwise on R. The convergence is also uniform; the proof is 
similar to that in (a). 


We need to show that 


. n x . 
lim ——~= lim 
x2—0+ £ x2—0- x 


For the right-hand limit we have 
giti/n 


; d 1 
lim = lim 2!/"=0, 
x—0t x a—Ot+ 


as desired, and the left-hand limit is similar. 


For 0 < x < 1, we have |f(x) — fn(x)| = a — a'41/". With g(x) = 
a —a'+1/" elementary calculus methods show g(a) < =forO<a@<1.In 
other words, 


IF (e) ~ fao)| < = 


holds if 0 < x < 1; the same inequality holds for —1 < x < 0 by symmetry 
(both f and f, are even functions). 
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7. (a) For any fixed x, x + 1/n — x as n — ov. Since f is continuous at x we 
must have f(x + 1/n) — f(a), as desired. 


(b) If f(x) = 2”, then 


faa) = (@+1/n)? =a? + 245. 


n 


In this case f,, — f pointwise, but not uniformly. 


9. Let € > 0 be given. Since h is uniformly continuous on R, we can choose 6 > 0 
such that |h(s) — h(t)| < € whenever |s — t| < 6. Because f, — f uniformly on 
I, we can choose N such that | fn (a) — f(x)| < 6 holds for all 2 € J whenever 
n > N. By our choice of 6, this guarantees that |h (fn(x)) —h (f(x))| < € for all 
x € I, as desired. 


11. The result is just a restatement of the analogous fact for Cauchy sequences of num- 
bers. 


5.1. The Riemann Integral: Definition 
and Examples 


1. (a) Calculation gives RS(f,P,S) = 0-0.8+1-0.2+2-0.74+3-03+4- 
0.9+9-0.1 = 7.0; ||P|| is 0.9, the largest width. 


(b) To make RS(f,P,S) as small as possible, choose left endpoint samples: 
S = {0,0.8, 1.0, 1.7,2,2.9}. This gives RS(f,P,S) = 0- 0.8 + 0.8? - 
0.24+1-0.7+1.77 -0.3+4-0.9 + 2.97 -0.1 = 6.136. 


(c) Tomake RS(f, P, S) as bad as possible, choose right endpoint samples: S = 
{0.8, 1.0, 1.7, 2, 2.9, 3}. This gives RS(f, P,S) = 0.87 -0.8+12-0.2+ 
1.7? -0.7+ 27 -0.3 + 2.97 -0.9 + 37 - 0.1 = 12.404, an error of 3.04. 


(d) The sample set S = {./3} gives RS(f,P,S) =3-3=9. 
2. (a) Sampling at midpoints gives RS(f,P,S) = 8. 

(b) Samples S = {0, 2,3,4} give RS(f,P,S) = 9. 

(c) If P = {0,4} and S = {0} then RS(f,P,S) = 0. 

(d) For P = {0, 4} find a sample set S so that RS(f,P,S) = 8. 


(ec) No two-piece partition P gives RS(f,P,S) = 0; the second summand has 
to be positive. 


(f) Choose all partition points except the last one very near zero. 


3. The fact that f(a) > 3 for all x € [0,2] implies that RS(f,P,S) > 6 for every 
partition P and sample points S. This implies, as in the proof of Proposition 5.3, 
that the integral itself cannot be less than 6. 
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7. Notice first that the inequality 0 < f(a) < g(a) implies that 0 < RS(f,P,S) < 
RS(g,P,S) holds for all suitable partitions P and sample sets S. To show f is 
integrable on |a, b], let € > 0 be given. Since g is integrable, there is some 6 > 0 
that works for g (and the value J = 0) in Definition 5.1. The preceding inequality 
shows that the same 6 > 0 works for f. 

9. Let € > 0 be given. Since J; satisfies the definition, we can choose 6; > 0 that 
works for J; and e. Similarly, since [2 satisfies the definition, we can choose 62 > 0 
that works for [2 and €. Let 6 be the smaller of 6; and 62, and calculate any Riemann 
sum RS(f,P,S) for ||P|| < 6. Then both 7; and Jz are within « of RS(f,P,S), 
and so within 2e of each other. Since this holds for all positive «, we must have 
Ty = Ip. 

11. (a) The condition holds by uniform continuity of f on [a, b]. 
(b) Note that 


IRS(f, PS) — RS(F,P,T)| =|9_ (F(se) — f(s) Aas 


< Se Az; = «(b—a). 
i=1 
12. (a) Look at individual summands of RS(f,P, M). For each i, we have 


pean 2 _ 42 
f(m) Agi = SOS (as ee oe at 
Adding these up gives a telescoping sum, with value (7? — 23)/2 = 1/2. 
13. All the calculations work the same as in Problem 12, except that we need 6 = 


e/(b—a). 


5.2 Properties of the Integral 


1. (a) The function h is integrable because it is the difference of two integrable 
functions. 


(b) Inthe situation of Corollary 5.6, we can set h(x) = g(x)—f (a). Theorem 5.5 


says that h is integrable and that ae g- pr f= ih h. By (a) the last quantity 
is nonnegative, which is what we wanted to show. 


3. The result of Problem 2 implies that the integral lies between —6 and 12. 


5. (a) By the extreme value theorem, f assumes a minimum value, say f(c) = m, 
for some c € [a, b]. By hypothesis, m > 0. Now f(a) > m forall x € [a, 8], 
and so Corollary 5.6 gives 


[rz [m=mo-a>0, 


as claimed. 
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11. 
13. 


5.3 


Solutions 


(b) Part (a) implies that f(c) < 0 must hold for some c € [a,b]. For similar 
reasons, f(d) > 0 must hold for some d € [a,}]. If either f(c) = 0 or 
f(d) = 0, we’re done; otherwise, the intermediate value theorem says that 
f(e) = 0 for some e between c and d. 
(c) Let f(x) = 1 for « > Oand f(x) = 1 for x < 0. Then JF = 0, but 
f(c) £0 for all c. 
The claim amounts to saying that — Hie fl < As f< fs |f|. This follows from 
Corollary 5.6, and from the fact that — | f(x)| < f(a) < |f(x)| for all x € [a, 6]. 
(a) The function f is integrable on every interval [a, b] because it is built up from 
integrable functions as allowed by Theorem 5.8, page 232. 


(b) A look at the graph shows iia f=-1. 
(c) A look at the graph shows [ lad f= =n: 
All parts are basic elementary calculus calculations. 


Suppose f(x) < g(x) holds except at some finite list 71, v2, ... Yn of points in 
[a, b]. If we redefine f at these points (we could set f(x:) = g(x), for instance, 
without changing the integral) then f(a) < g(a) holds throughout [a, b], and so 
Corollary 5.6 gives the desired result. 


Integrability 


1. All of the concocted functions are continuous. 


10. 


11. 


13. 


Here is one possibility: Set h = €/100 and use P = {0,h, 1—h, 1+h,..., 
10 — h, 10}. 

Given any box sum for if; f with value less than ¢, add (if necessary) one more 
point at x = 1 to get another box sum with value less than «. This same box sum 
works for le f; just ignore boxes to the right of x = 1. 

Given € > 0, choose an integer N > 0 so that 1/N < e. Supposen > m > N. 
Then J, and J,, are Riemann sums for partitions P,, and Pm such that ||Pn|| < 
dn and ||Pm|| < 5m. Because we chose the 6; to be decreasing, we have both 
On < dn and dm < dn. Thus both P,, and P,, have norm less than dy, and so 
[In — Im| < 1/N < e, as desired. 

Let m; and M; be the inf and sup of f on [x:-1,x:]. For each i, it is clear that 
mi < f(si) < Mi, and hence that m;Ax; < f(si)Axv; < M;Az;. Summing 
over 7 gives the desired result. 

The function fig is constant on each partition subinterval, and is therefore integrable 
on [a, 6] by Theorem 5.8, page 232. By construction, moreover, the integral ss foig 
has the same value as the upper sum US (f, P). 

It is also clear that foig(x) > f(a) for all « € [a,b], and so (by Corollary 5.6, 
page 228) we have 


us(s,P)= f fme> f t= 


The proof that LS (f, ) underestimates J is almost identical. 
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5.4 Some Fundamental Theorems 


1. 


11. 
15. 


The first claim follows immediately from the mean value theorem for integrals. 
Finding examples for the second part is easy. 


The new average value is 5 - 3 + 7 = 22; the average value behaves “linearly.” 
(a) b= 9/4. 
(b) b=2. 
(c) b= V3 & 1.732. 
(d) b®& 1.094. 


Let M be the maximum value of f. Then h(x) = M — f(x) > 0 forall x € [a, 6]. 
The hypotheses imply that ae h = 0; hence h(x) = 0 for all x € [a, db}. 


The result need not hold if f is not continuous. Consider the function f given by 
f(0) = 1and f(x) = 0 on (0, 1]. 


The hypothesis means that F'(b) = He f = 0?/2 for all x. Thus f(b) = F’(b) =b 
for all b. 


The interval [0, 8/3] is one possibility. 


(a) The uniform limit function f is continuous on [0, 1] and hence is also inte- 
grable there. 


(b) For given € > 0, choose N such that | f(x) — f(x)| < € for alln > N. Now 


[w-f4- [mals [lm-si< [ewe 


which completes the proof. 
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